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I. INIEOEOC TION 

PCX the remainder of this decade and beyond, the Navy is 
faced with the diffictlt problem of attracting and retaining 
sufficient personnel to meet its ever increasing manpower 
reguirements. The fleet is expanding toward 600 ships while 
the available manpower pool of 17 to 21 year old men and 
women is projected tc decline. Each year, millicns of 
dollars are spent recruiting, training and maintaining 
enlisted personnel. Numerous enlistment standards models 
have beer developed tc improve the screening, selection and 
assignment processes for all Navy ratings. Ccrtiruing 
enlistment standards research is important since it may 
improve manpower planning, reduce attrition, enhance job 
performance, and lengthen careers. It is through such 
research' that the ultimate goal of increased readiness at 
lower cost nay be realized. 

a, IDIfCSE OF IKALYFIS 

Ihis study attempts firstly to improve upon the methcd- 
clogy presently utilized to develop enlistment standards 
models. In particular, different techniques for developing 
such models are presented, along with alternative criteria 
for measuring successful performance. A means of measuring 
the utility, or usefulness, of such efforts is also 
provided. an attempt has been made to present these methods 
in a clear manner sc that those researchers who follow may 
more readily understand the process. The analysis expands 
upon the experience of numerous similar efforts, including 
several graduate theses prepared at the Naval Postgraduate 
School and many research projects conducted under the 
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ausfic€s cf the Personnel Eesearch and De velc^ineat 
Centex (KPETC) and the Center for Naval Analyses (CNA) . 

Ihe seccndary purpose cf this study is tc discover if 
the selecticn standards for cne particular Navy rating ray 
be impicved upon by axalyring data available at the tine of 
enlistuert. Most predictive rodels developed to date have 
focused cn successful conpletion of technical training 
schools, cr on attrition. Ihis study is aligned with a mere 
recent analytic trend of attempting to predict successful 
cper at i cr al performance in the fleet. This approach is 
considered most appropriate in today’s highly technical 
Navy. The tremendous cost, in terms of both dcllars and 
time, associated with training and retaining Navy perscnnel 
demands raximum return on investment. By focusing cn opera- 
tional success tc develop a larger, more experienced career 
force, there exists the potential to reduce the burden of 
recruiting and training new enlistees. 



E. EATING EELECTED ICE ANAIYSJS 

Tc acccaplish the above stated purposes, data pertaining 
to operational performance of personnel in the Aviation 
Machinist’s Mate (AE) rating were analyzed. ACs are 
aircraft engine mechanics who inspect, adjust, test, repair 
and overhaul engines used in all Navy airplanes and helicop- 
ters. AEs also perform routine maintenance, prepare 
aircraft for flight, and assist in handling aircraft cn the 
ground cr aboard ships. They perform maintenance and 
servicing on either jet or reciprocating engines, ard on 
subsystems such as fuel, oil, induction, compression, 
combustion, turbine and exhaust. Other AD functions include 
supervising maintenance, analyzing fuel and oil samples, 
keeping records, evaluating engine performance, and main- 
taining accessory components, drive systems and gear boxes. 
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flviaticii MacJiinist’s Mates are assigned primarily to 
Naval Aviation sguadrons or to Aircraft Intermediate 
tainterarce Departmeits. Ihese assignments may te eitler 
afloat cr ashore. AIs may also be assigned as instructors 
in trainirg activities, and they are eligible to volunteer 
for flight duty as aircrewmen, [Ref. 1] 

Eresently, there are over 13,000 men and women assigned 
to the AE rating. Since ADs are assigned to virtually every 
Navy aviation unit, they represent a vital element in 
ensuring a high degree of aircraft readiness is mairtained. 
As such, the overall mission effectiveness of Naval Aviation 
units is directly linked tc the quality and performance of 
memhers cf the AE rating. 



C. CIGANI2ATION OF lEIS STODY 



Ihis chapter has discussed the purpose cf this study, 
and described the AD rating and its importance to the Navy. 
Ihe next chapter will provide background informaticr on 
enlistmext standards research, and present in general terms 
the evcluticn cf predictor and criterion variables that 
emerged during the development of the models contained in 
this research. Chapter III describes the data base and AD 
data set that provided specific measures of operational 
performance for analysis and model formulation. Chapter IV 
presorts the three statistical analysis techniques em;ployed 
in developing six enlistment standards models. Chapter V 
provides the method and results of the utility analysis 
conducted on these models. Utility analysis represents a 
means by which the usefulness of similar efforts may be 
measured. Chapter VI draws conclusions from the analysis 



and presents reccmmencations 



for further research 



II. £ £IECT ICN OF 7ABIAF1ES 

Itis ciapter gi^€s a brief description of some cf tbe 
selecticr procedures in use at the time of the data collec- 
tion. Ihe results cf previous research on predicting job 
performance are reviewed and ’ predictor and criterion vari- 
ables tiat have been shown to be useful are identified. 



A. SIIECIICM BACKGECCND 

At the time tie data used in this analysis was 
collected, the Navy considered a number of applicant charac- 
teristics tc guide selection and classification decisions. 
These characteristics included education, high school degree 
status, age, number of dependents, scores on the 12 Armed 
Services Vocational Aptitude Battery (ASVAB) subtests, and 
some composite scores. The Armed Forces Qualification Test 
(AFQT) is cne of these composite scores, and an applicant’s 
score on the AFQT depended on the sum of his scores on the 
ASVAE suhtests Arithmetic Reasoning (AR) , Spatial Perception 
(SP) , and Herd Knowledge (W K) . The AFQT score was reported 
as a percentile — a score of 80 meant that an applicant's 
total score on the three subtests was higher than the scores 
achieved by 79 percent of his peers. The AFQT percentile 
score was also used tc classify the applicant into cne of 
five mental categories or AFQT groups. For example, those 
with a score of SO or better were classified in mental group 
I, and those with a score cf 10 or less were classified as 
group Vs. 

Another composite score is the Success Chances of 
Recruits Entering the Navy (SCREEN) score- This score is 
derived from the personal characteristics of age at entry. 
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j€axs of sctooliug, whether cr cot the applicant had d 
dents, and AFQT percentile score. This composite sccr 
teen used ty recruiters since Cctoter 1976 in determini 
applicant's eligibility to enlist- £He£. 2] 

A final composite score that is used in classi 
recruits tc the AD rating is made up of the sum cf 
recruit's standardized scores on the ASVAB sut 
Arithmetic Reasoning (AR) , Electronic Information 
General Science (GS) , and Mathematical Knowledge (MK) 
minimum score of 19C on this composite was necessary 
recruit to enter the AB rating. 



E, EIVIEJS CF PBEVIOCS MILIIAEI STUDIES 



Studies on predicting military job performance 
mainly ccncentrated cn the survivability of recruits th 
various stages of their military careers. These s 
include recruit training. Class "A” School, first two 
cf enlistment and first term of enlistment. 

lurie used number of dependents, years of educatio 
AFQT score to predict the performance of the Electro 
Technician <ETN) and Ship' s Serviceman (SH) ratings, 
found that for the SE rating, non-higii school graduates 
lower AFQT scores were promoted faster than those 
higher scores, although AFQT score had no impact cn 
term survival. Tie AFQT score did aid in predi 

advancement and survival for members of the ETN ta 
[Ref. S] In another study of eight year survival r 
lurie found that education level was the most impc 
predictor. Interestingly he also found that mental gr 
recruits had the worst record in surviving Class "A” Sc 
[Ref. il] 

A study on the factors affecting first term s urviva 
reterticr behavior cf Machinist's Mates (MM) and E 
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lechDicians (BT) was cond 
found that Els with create 
greatly impioved charces o 
enlis tnent . For AMs, those 
groups had greater survive 
ioth ratings, older men 
survival. In relaticn to r 
more years cf education ha 
ment, while with MAS, havin 
ability cf reenlis tmert. [ 

Studies of military jo 
gated the effect of the 
survival. Lockman found t 
ured by SCEEEN) and trainin 
those whc were in tie LEP 
highest survival rates £ Eef 
age, education, recruit tra 
dependents, mental group a 
varying degrees of signifi 
sur vi vatility £Eef. 

Mere recent studies ha 
rather than single measure 
because it is rare that a 
the full scope cf jcb perfo 
expand tie sur vivability cr 
cf jcb performance, such a 
achievement of certain payg 
not availahle under this 
categorized as a success or 
and Snyder and Bergazzi £Be 
success cr failure in their 
more variability on the cri 



ucted by Fletcher in 197S. He 
r than 11 years of education had 
f surviving their first term of 
in the lowest and highest mental 
1 probability than others. Per 
had a higher probability of 
eenlistment, those BTs with 12 or 
d a lew probability of reenlist- 
g a dependent increased the preb- 
Bef. 5] 



b performance have also investi- 
Celayed Entry Program (DEP) on 
hat if recruit quality (as meas- 
g guarantees were controlled fer, 
for three or more months had the 
. 6 j. Thomason found that DEP, 
ining location, race, number of 
rd follow on tour assignments had 
cance in determining first’ term 



ve favored the use of multiple, 
s cf job performance. This is 
single measure adequately cevers 
rmance. One approach has teen to 
iteria to include other measures 
s eligibility to reenlist and the 
rades. A continuous criterion is 
approach; sailors are either 
as a failure. Nesbitt £Eef. 8] 
f. 9] defined various degrees of 
studies in an effort to generate 
ter ion . 
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CEaiJEICN ANE PfiiEICTOB VAEIIBLES 



Ii Host cases fchen a siigle job performaace measure 
(criterion) has teen used in previous research, a measure of 
survival has been the overwhelming choice. Ihis is because 
such a criterion is readily measurable, is continuous, and 
is of importance to the Navy since tne costs associated with 
attrition and subsecuent replacement are considerable, 
ether single criteria have teen length of service (iO£) , 
time to pro notion, highest rank cr grade achieved, retention 
(as measured by reerlistment choice), and performance at 
Class "a" Schools. 

Ihe cemnon predicters of jet performance are education, 
number of dependents, age, sex, race, ASVAB subtest scores, 
AFQI scores, mental group, DEP status, and some "after 
accession" variables such as recruit training location, 
subsecuent dependent status, and initial and follow on duty 
assi g r mert s . 

In this study two criteria will be considered. Ihe 
first kill be an ICf criterion and the second will be a 
composite criterion where success will te defined as 
completing the first term of enlistment, being eligible for 
reenlistment, and achieving the paygrade E-4. The candidate 
predictor variables will be age at entry, sex, race, entry 
paygrade, education, dependent status, term of enlistment, 
ASVAE subtest scores, AFQT scores and the composite score to 
qualify for the AD rating. Ihe specific variables from the 
AD data set used for analysis, as well as the evolution of 
the data set, are discussed in the next chapter. 
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III. ^TA EASi D E7EL QPMEN'T 

liis chapter pro'sides infor nation concerning the naster 
data tase and the setset of this master file, the AC data 
set, that was used in this study. The generation cf this AD 
data set is descrited in detail, as are the specific 
predictor and critericn variables discussed in the previous 
chapter . 



A. 



HASIEB EIIE 



Erlisted history records on over 206,000 ncn-prior 
service sailors who entered the Navy during the period 1 
Eeptenter 1976 to 21 December 1978 were compiled by the 
Defense Eanpower Data Center (DMDC) staff. This master file 
was created by merging data elements from four separate 
files; (1) the DMDC Cohort file, which is itself a compila- 
tion cf infcrmation from DMDC*s Enlisted Master Reccrc and 
loss files; (2) a Navy Health Research Center (NHRC) file; 
<3) a prcmction examixation file; and (4) a Chief cf Naval 
Educaticr and Training (CNET) file. 

Tie lire Cohort file contains personal and demographic 
data or individuals at the time they entered the service. 
Additionally, it is updated quarterly by the Military 
Personnel Commands with active duty or service separation 
information as appropriate. This file provided over 150 
variables to the master file. 

Tie NHFC file contains information on each enlisted 
member of tie Navy who has teen or still is on active duty. 
It is updated quarterly from Navy Military Personnel Command 
(NMPC) change tape extracts, and follows a service memter 
from date cf enlistment to date of discharge. The NKRC file 
represents approximately 30 variables in the master file- 



16 



Ibe f£CEOtioi3 exaninati cn file includes advancement eiam 
and ficmcticn data, and the CNEI file contains inf cr mation 
cn fciiral training received bj individuals in the data base, 
logetier these files provided over 60 variables to the 
master data base. 

Ihe master file, containing 243 variables, is maintained 
at the Naval Postgraccate School. The final update tc the 
file includes DMCC data current as of 30 September 1S62, and 
KHRC data current as of July 1982- Ihe program tc access 
data cn the selected rating is contained in Appendix A. 



E. AI EAIA SET 

This section describes 
that contains the observati 
study. The AD data set 
iterative screens, ard new 
to remcve aberrant observat 
and criterion variables pri 
iterative steps ultimately 
the AD data set frcm 5,5 
programs used to screen the 
and criterion variables ar 
logic for these processes 
this chapter. 

1 . Scr een s 

Since one purpose 
Aviation Machinist's Mates 
the fleet, the first sere 
select ctly those cases wh 
appeared in the last master 



tie evolution of the AD data set 
ens and measures analyzed in this 
was derived through a number of 
variables were created, in order 
ions and to refine the predictor 
cr^ to statistical analysis. These 
reduced the number of cases in 
62 to 2,820 observations. The 
data and to create new predictor 
e contained in Appendix A. Ihe 
is discussed in the remainder of 



of this study was tc analyze 
whe had operational experience in 
en performed on the data was to 
ose final DMDC rate (DMECBAIE) ^ 
file update as ADs. This means 



fThe actual variable name associated with tha comment is 
provided in parentheses throughout the remainder cr this 
chapter . 
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they sere Korkiig it the 
their separation frci the 
last tile update if they w 
screen allowed for people 
while ensuring that those c 
were eaciuded from the anal 
Ihe cases were nex 
Navy service (PRIOESB^) . 
have charged their social 
removed fron the sanple. 
observations were courted m 
Ihe observations in 
to select crly those peopl 
5, 6, or 7 (lESIfORh) at en 
in use during the period 
data set erlisted. Also, 
(ASVAEs) were impossibly h 
set . 



AD rating at either the tine of 
service, or at the time ct the 
ere still on active duty. Ihis 
to migrate into the AD rating 
ases who left for another rating 
ysis. 

t screened for ADs with rc prior 
In addition, individuals who nay 
security number (SSNCRNGEj were 
These screens ensured that no 
ore than once in the analysis. 

the AD data were then screened 
e who were tested on ASVAE Eorms 
listment. These test forms were 
in which the individuals in the 
those cases whose subtest scores 
igh were eliminated from the data 



Two other screens were conducted to capture those 
observations who enlisted in either the Regular Navy or 
Naval Reserve (SERVACCS) , and who were known to have signed 
up for at least four years act ive service (RECENLST) . It is 
worth loting that during model development, the term of 
enlistment measure (TEEhENLT) was consistently significant, 
hut with a negative value for the parameter estimate. Ibis 
indicated that those individuals who enlisted for longer 
obligated service actually served less time than those who 
signed up for shorter terms of enlistment. The parameter 
estimate for term of enlistment changed to a positive value 
when the RECENLST screen was implemented. Apparently, by 
screening for four year active duty obligors, the data set 
then excluded those reservists who were required to serve 
only three years of their six year obligation on active 
duty. Fox these observations, a six year term of enlistment 



18 



lias ac errcneous value for the TEEMENLT variable. Ihis 
importert discovery reveals a probable flaw in earlier 
similar erlistmert standards analytic efforts. 

Ancther screen facilitated inclusion of those cases 
for viici there was clear indication of their eligibility to 
reenlist (ELGBEDP1 cr ELGEEDP3) . The final screen in 
setting up the AD data set included cnly those separated 
individuals who could be easily identified by "gccd" cr 
"bad" interservice separation codes (ISC) . Observations 
with separation codes in the "grey" area (death, hardship 
discharge, entry into officer programs, or medical disguali- 
ficaticn) were removed from the data set since it was felt a 
legitimate deter mina ticn of their success or failure could 
not be made. 

Having incorporated these screens, freguency distri- 
huticn analysis facilitated the removal of aberrant or 
impossible cases. lor example, the maximum length of 
service tetween 1 September ^S16 and 30 September 1982, the 
period of the data base, was 72 months. Cases whc were 
listed as having greater than 72 months LOS were removed 
from the data set. 

2 . Cr e ate d Variable s 

This discussion identifies the variables created in 
addition to those already in the master data base. Creating 
these variables facilitated more detailed analysis of obser- 
vations in the AD data set, and enhanced the enlistment 
standards model development process. The following comments 
will also address the reasons for selecting some variables 
and not others. 

a. Predictor Variables 

There were several ways that individuals ir the 
master data base might appear in the AD rating during their 
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career. Ihey aay have enlisted in a program to beccne an 
AD, taken the AD rating exam, and/or achieved the AC rating 
through cn the job tiaining. To distinguish betvieen the 
varices combinations of these processes, an entry group 
variable (EKIEYGEP) ias created. Freguency analysis of this 
new variable ccnfiined that the final DMDC rate of AD 
screened and selected only those cases who actually ended up 
as ADs. An effort was made to develop models for various 
combinations of these entry groups during stepwise regres- 
sion analysis. However, the derived models were not signif- 
icant, and they did not improve upon the models ultimately 
selected for analysis. 

A common predictor variable in enlistment stan- 
dards Dcdels is cne dealing with education. The measure in 
the master data base reflecting education level (HYEC) was 
converted from a gualitative value to a continuous variable 
(CHYEC) to facilitate statistical analysis. In addition, a 
dichctcDcus (0,1) variable was created to reflect attainment 
cf a high school degree (HSDG). During stepwise analysis, 
which is discussed in the next chapter, each of these two 
new variables was offered separately as a candidate 
predictor variable. In nearly every instance, KSDG was 
shown tc be more significant than CHYEC. 

Other cojiacn predictor variables which measure 
entry-level attributes are ASVAB subtest scores. Ic allow 
the use cf these measures of individual characteristics, the 
scores were standardized, and the created variables 
(SASVAEs) were considered during model development- As 
mentioned in Chapter II, standardized ASVAB subtest scores 
are used in various combinations as composite measures. Cne 
cf these composite variables is AFQT percentile (AFClfCNI) , 
which also yields AFCI groups (AFi^TGRPS) . Another cempesite 
is the score used to determine eligibility fer the AD 
ratine. Two variables were created in the AD data set to 
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identify tiis latter composite measure. Ihe first varialla 
created (AECOMPOS) was a continuous variable which had a 
value eoual to the sum of the four ASVAB standardized 
subtest scores that make up the composite. The second vari- 
able created (ADMINSCE) was a dichotomous variable which 
distixguished those AICOMPOS values greater than or egual to 
190 from these ACCOMICS values less than 190. Each. time one 
of ttese four composite measures was offered as a candidate 
predictor variable during regression model develcpment, 
three separate trials were run. One trial contained the 
composite measure ard all 12 SASVAB variables. Another 
trial contained the composite variable and only these SASVAB 
variables that did net make up the composite variable. The 
third trial contained only the composite measure with no 
SASVAB variables. Additionally, the trials contained either 
AFQIPCKI or AEQTGRPS, and either ADCOMPOS or ADiilNSCE. Ihe 
purpose ci this iterative process was to ensure multicclli- 
nearity effects were minimized among the independent vari- 
ables. During the development of the regression models, 
AFQTPCNT and ADhlNSCE were consistently shown tc be mere 
Significant than AFC5GRPS and ADCOMPOS respectively. for 
this reason, they were included among the final candidate 
predictor variables used in stepwise regression analysis. 

Another predictor variable commonly considered 
by enlistment standards 'research deals with marital status 
and dependents. The master file contains a qualitative 
variable (METLDPND) which reflects marital status and number 
of dependents. This study created a dichotomous variable 
(DEPFKDTB) which distinguishes single individuals from these 
who are married and/cr who have children. Again an itera- 
tive process revealed this created variable to consistently 
be mere significant. 

The effects of race and sex were also considered 
in tie analysis by creating new variables. The best 



variatle iu the master file to indicate race acd ethric 
status identified categories cf whites, hlacks and ethers 
(EACE). Since this variable was gualitative, three dumiy 
varieties were created (WHITE, BLACK, and OTHEE) . Ic allow 
analysis of the effects of sex, the master file variatle 
(SEX) was converted tc a "0,1" variable (NUSEX) , 

Several ether predictor variables were cccsid- 
ered and tested for significance and possible inclusicn in 
the final set cf candidate predictor variables prior to 
developing the regression models. Age at enlistment 
(ENTEXAGE), enlistment paygrade (ENTEPAXG) and term of 
enlistmert (lEBMENLT) were among those selected. Hany vari- 
ables were rejected because other measures were tetter able 
to capture the desired effects. One particular variable 
which did not show to be significant was the composite 
SCHEEK variable (SCEEEN) discussed in Chapter II. This may 
be because the components cf the SCEEEN variable are indi- 
vidualiy mere appropriate for analysis, particularly when 
the emphasis is cn predicting operational performance in the 
fleet. Similar results were cited by McGarvey [Eef. 10]. 

Ihe final set of predictor variables created in 
the AI data set are interaction terms. These variables 
represent all twc-level interactions of the seven variables 
that met the specified significance level during stepwise 
regression analysis. The development cf these variables is 
discussed in more detail in Chapter IV. 

b. Criterion Variables 

As discussed in Chapter II, this study used two 
critericr variables when developing the six models — length 
cf service measures and success measures. Tne length of 
service measure for regression models is a continuous vari- 
able (TAEMSI), and fer discriminant models is a dichctcmcus 



variatle (£CCCTAf).2 5UCCTAf has assigned a value c£ ‘one’ 
if the value of lAIfSI was greater than or egual tc 48 
months, or if the value of lAfMSI was greater tnan cr egual 
to 45 months and the individual entered the Navy in Cctccer, 
Kovemter cr December 1978 (lAIEENLT) . This was done to 
ensure those cases who did not have the op p or tun it y tc serve 
48 mcrths were net injicperly classified as failures. 

Individuals were considered as successes, for 
furjeses of this analysis, if they served 48 mcrths cr 
longer, achieved paygrade E-4, and were recommended for 
reenlistment. Again, observations who did not have the 
opportunity to serve 48 months were also considered 
successful on the ICS portion of this criterion if they 
served at least 45 mciths. Ihe success criterion variafls 
(S0CCESS2) captures these measures by considering SUCCIAF 
and twe other created variables (SUCCPAYG and SUCCEEUP) . 

SUCCPAYG identifies those cases who achieved E-4 
as measured by two created variables (PAYGEADE and N'JEYPAY). 
PAYGFADE was created from cne of two DMDC variables 
IPAYGEDE1 cr PAYGEDE3) that' measure an individual’s paygrade 
at the last file update or upon discharge from the service, 
as appropriate. NOEYPAY was created by converting an NKEC 
variable (flYPAYGED) from a categorical to a numeric vari- 
able. Using both DHIC and NHEC measures of paygrade ensured 
correct classification of an individual on this portion of 
the criterion. 

SOCCEEUP, the eligibility to reenlist portion cf 
the success criterion, was derived from the DMDC variable 
(ELGEEUPI) that captured the reenlistment code assigned upon 
an individual's discharge from the service. Service members 
cn active duty as cf the last master file update were 
considered eligible tc reenlist, as long as there was no 



2£iccjpiiiinant analysis requires the use of categorical 
vice continuous variables as classification variables. 
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ether irfcraatiCE tc the centrary. 
discuss how the inf crication ccDtained 
specifically evaluated using three 
analysis technigues. 



The next 
in the AD 
separate 



chapter will 
data set was 
statistical 



IV. 5IATIST1CAI ANALYS IS 

Iire€ distinct statistical methods were employed in this 
research: Descriptive Analysis, Regression Analysis and 
liscrinicant Analysis. All methods used Statistical 
Analysis System (SAS) procedures to analyze the data and 
develop the models. Table I contains a list cf the 46 
candicote predictor/discri ninating variables used in this 
study. In all, sir sets of variables emerged, and each set 
was aialyzed using bcth regression and discriminant techni- 
gues fcr comparison. These six sets of predictor/ 
discriminating variatles are shown in Table II, along with 
the appropriate criterion/classification variable. Each 
method, along with the results, are discussed in the 
following sections cf this chapter. It is worth noting 
that, while the results may not represent a marked improve- 
ment over tie selection process in use when individuals in 
the data set enlisted, the methodology presented may be 
applied to further analysis of the AD rating or to any otter 
rating in the Navy. 

A. EiSCEIfllVE ANAllEIS 

Descriptive analysis was accomplished through review of 
freguency distributions, summary statistics and multivariate 
corr elaticns. 

1. Ire quen c V Anal ysis 

Ireguency distributions are summary tables in which 
data are grouped or arranged into conveniently established 
numerically ordered classes or categories. The process of 
data analysis is, therefore, made much more manageable and 



TAB1£ I 

Candidate Pre cictor/Ciscriminating Variables 



Variatle 



Label 



AfC^PCKI 
UClGEfS 
iNIFYAGE 
EKliPi 1(G 
lEEMENlT 
ESCG 

lEPENEIS 

CHYEC 

EJSVAEGI 

SASVAENO 

EASliAEAD 

EASVAEliK 

SASVAEAE 

SAETiAEEP 

EASVAEEK 

SASVAEEI 

EASVAEMC 

SASVAEGS 

SASVAEEI 

SASVAE AI 

HACK 

CIEEB 

KOEEX 

AECCMPCS 

ACHJNECE 

INIEEO 1 

INIEE02 

INIEE03 

INIEECy 

INIEE05 

IKIEEC6 

1NIEEC7 



(CE EQUIVALENT) 

»c,'~ "■ ■' 



AFQT PEECENTIIE 

AEQT GFOUPS (5,4^, 43,4 A,3B, 3A, 2* 1) 
AGE Of INEIVIEUAL AT TIME OE ENTEY 
EKTEY lAYGEAEE (E0--O1 1) 

TEEM CE ENLISTMENT (NO. OE YEAES) 
HIGH-SCHOOL GEAEUATEM) V. OTHEE(O) 
SINGLE, NO DEPENDENTS (0) , CTHEEWISE 

:ae£ 



CCNVEiTEE NUflEEE CE YE 
SIANEAEDIZED SCCEE 
SIANDAEEIZED SCCEE 
SIANEAEEIZED SCCEE 
SIANDAEEIZED SCCEE 
SIANEAEEIZED SCCEE 
SIANDAEEIZED SCCEE 
SIANDAEEIZED SCCEE 
SIANEAEDIZED SCCEE 
SIANEAEEIZED SCCEE 
SIANDAEEIZED SCCEE 
SIANDAEEIZED SCCEE 
SIANEAEDIZED SCCEE 
1 

AE 
AE 



1 ) 



Hi 



BIACK, ELSE (0) 
NEITHEE BLACK IWE 
(01 FEMALE 
COMPOSITE 



ES OE EEUCATIOI 

- GENEEAL INFOEMATICN 
-NUMEEICAL CPEEAIICNS 

- ATTENTION TO DETAIL 

- HOED KNOHLEDGE 
-AEIIHMETIC EEASONING 

- SPACE PEECEPTION 

- MATH KNOWLEDGE 

- ELECTECNIC INFO 

- MECH COMPEEHENSICN 

- GENEEAL SCIENCE 

- SHOP INEOEMATION 
AUTO INFOEMATICN 



CAIE, 
AS VAB 
ASVAB 
DEPENEIS 
DEPENIIS 
DEPENEIS 
DEPENIIS 
DEPENEIS 
DEPENIIS 



WHITE, ELSE (0) 



HSDG * BLACK 



COMPCSITE 

* HSDG 

* BLACK 

* NISEX 

* TEEMENLT 

* SASVABAI 

* AEMINSCE 



SCEEEN 



INIEE08 


HSDG * 


NUSEX 


INIEECS 


HSDG =♦ 


TEEME 


INIEE1C 


HSDG =♦ 


SASVA 


INIEEl 1 


HSDG * 


ADMIN 


INIEE12 


BLACK * 


NUSE 


INIEE 13 


BLACK * 


lEEM 


1NIEE14 


BLACK * 


SASV 


INIEE15 


BLACK * 


ADMI 


1NIEE16 


NOSEX * 


TEEM 


INIEB17 


NDSEX =* 


SASV 


INIEE 18 


NDSEX * 


ADMI 


INIEE19 


TEEMEMI * S 


INIEE20 


lEEMEMT * A 


INIEB2 1 


SASVAEAI * A 


meanin of ul. 


In this s 


tudy. 


to provide counts and 


perc 



:CE 



frequency analysis wi s perfcrmed 



uals in the sample, and to illustrate tie range of the 
predictor and criterion variables. Tiis infcrmation 
provided a tase upon which tc screen aberrant observations 
and tc compare the results of this study. Frequency distri- 
tuticns are provided in Appendix B for the AD rating. 
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TABIE II 

SelecticD Models 



Model 


Predictors/ 
Discriminating 
V ariatles 


Regression 
Cr it erion 
V ariable 


Discc iminan 
Classif icati 
Variable 


A 


E2PENDTS 

ADMINSCR 

PIACK 

NOSEX 


TPRMENLI 

ESDG 

OTHER 


TAFMS 1 


SUCCTAP 


P 


TEfiMENLT 
INTER 04 
INTER 14 


IKIES03 

IKTER08 

I^TER21 


TAFMS 1 


SUCCIAP 


C 


INTER 03 
SASVAPWK 


IKTER08 

PATRPAYG 


TAFMS1 


sue CTAE 


I 


EEPENDTS 

CIHER 

SASVAEAI 


HSDG 

TPRMENLT 

SASVABWK 


SUCCESS2 


SUCCESS2 


P 


INTER03 


IMEfi09 


SUCCESS2 


SUCCESS2 


P 


INTER03 
INT ER21 
SASVAEEI 
SASVAESI 
CHYEC 


INIER09 

OTHER 

SASVABMK 

aecigrps 


SUCCESS2 


SUCCESS2 


Note: 


Variable s 


;ets A, B, 


E and E resulted from 



stepwise reqressiou techniques. The var- 
iable sets C and P resulted from stepwise 
discri linart techcigues. 
lable I provides the labels for these 
variables. 



2, Sum n ary Stati stics 



like freguen 
statistics are usefu 
tative data. These 
of location, dispe 
extract and summ 
Eepresentative summa 
data set are shown i 



cy dis 
1 for a 
summar 
rsion 
arise 
zy sta 
t Table 



tributions. 


descripti 


ve sunmary 


nalyzing and 


interpret 


irg guanti- 


y statistics 


represent 


proper ties 


and shape. 


and may 


be used to 


features of 


the 


data set. 


tistics for 


variables 


in the AD 


III. 
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TABLE III 

Selected Suamiary Statistics 



VABIAEIE 


MEAK 


STANDARD 

DEVIATION 


aiNIHOM 

VALUE 


cAXiaca 

VALUE 


AECafCNI 


43. 4 S 


20. 50 


6. 00 


99.00 


lAEMS 1 


49.22 


9.44 


2.00 


72. OC 


ENlBf AGE 


18. 8 5 


1.82 


17.00 


30.00 


SAfVAEGI 


49. 9 1 


7.71 


20. 00 


66.00 


SAEVAENC 


50.4 3 


7.80 


23. 00 


69.00 


SAEVAEAL 


50. 3 4 


9. 28 


20.00 


80.00 


S AEVAEWK 


48. 2C 


7.51 


30. 00 


64.00 


SAEVAEAE 


48. 9 2 


6. 98 


29. 00 


65.00 


S AEVAEEF 


49. 12 


8.39 


21.00 


66.00 


SAEVAECK 


50. 4 6 


7. 01 


26. 00 


67.00 


SAEVAEEI 


51.04 


6.98 


20.00 


68.00 


SAEVAEMC 


50. 08 


8. 24 


25. 00 


71.00 


SAEVAEGS 


49.51 


7. 14 


24.00 


70.00 


SAEVAEEI 


50. 9 C 


8.48 


20. 00 


65.00 


SAEVAEAI 


51. 16 


9. 51 


26. 00 


67.00 


CBIEC 


11.79 


0.73 


3. 50 


16.00 


AECCMECS 


1 99. 99 


1 9. 19 


99.00 


264.00 



tul tiv aria te Cor rel aticn An alysi s 



Measuring the strength of the relations hip tetween 
variatles nay be acccnplished ty correlation analysis. This 
technigue etables one to gain an idea of the degree of asso- 
ciation or covariation that a variable has with another 
variable. The summary measure that expresses the extent of 
this relationship is the coefficient of correlation, r, 
whose values range from -1 for perfect negative correlation 
to +1 for perfect positive correlation. Values close to 
zero indicate little systematic covariation between two 
variables. Correlation coefficients for quantitative vari- 
ables used in this study are contained in Appendix B. 

Assessing the strength of association between vari- 
ables does not allow a researcher to predict the value of 
one variable from the value of another variable. The latter 
involves regression techniques, and is presented in the next 
section of this stud^. 
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BEGfiESSION ANALYEIS 



Eegression analysis is cue method used to develc^: a 
statistical model that can predict the values of a dependent 
cr res£cns€ variable based on the values of independent or 
erplanatcry variables. Rather than merely measuring the 
association between variables with correlation analysis, a 
regression model attenpts to predict or explain- the value of 
the critericn variatle by developing an eguation that is 
based on weighted values of one cr more predictor variables. 

In developing the selection models in this study, the 
process employed was to first apply a variable "search" 
procedure called stepwise regression. The resultant models 
were ther analyzed by simple regression analysis, and vali- 
dated against a hcld-cut sample of the data set. The 
details cf this process, the specific models derived, and 
results of the analysis are reported in the following 
sections. Appendix C contains a discussion of regression 
analysis assumptions and methodology. 



^ • £te Fwi se Re gre ssion 



Cne of the desired characteristics of a regression 
model is parsimony, which means including the least number 
cf explanatory variables that permit adequate interpretation 
cf the dependent variable of interest. Such mcdels are 
easier to interpret ard are not as likely to be affected by 
nulticcllinearity3 problems. In developing the models for 
this study, stepwise regression procedures were employed to 
find a "test" combination cf predictor variables, thereby 
avoiding the computationally complex and costly process of 
examining all possible regressions. 



3tulticcllinearity refers to the condition in which some 
cf the independent variables are highly correlated with each 
ctber. When multicollinearity is present, the values of the 
regression coefficierts for the correlated variables may 
fluctuate dramatically. 
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In this stud^, two 
ahles were analyzed with th 
set included those entry-le 
were corsicered lihely t 
critericE, based on a revie 
studies. As discussed i 
included individual and dem 
ability# aaount of educati 
larital status# AFQT yercen 
provides a list of the 18 
data set that were used in 
Ihe second set of 
included the seven variable 
specified significance lev 
lEodel. In addition# this 
actions* of these seven var 
terms in this study repr 
previous enlistment standar 
analysis clearly indicate 
effects among predictor v 
variables and 21 interact 
analysis are also contained 
Csing these two se 
ables# tie stepwise proce 
criterion variables# 3AFMS1 
in Chapter III, The resu 
from a urifcrm random split 
observations in the flD dat 
constituted approximately 
the AE data set. Sc doin 



sets of candidate predictor vari- 
e stepwise procedure. The first 
vel attributes and measures that 
0 be good predictors of each 
w of similar enlistment standards 
n Chapter II# these variables 
cgraphic measures such as mental 
on# entry age# entry paygrade# 
tile# and ASVAB scores. Table IV 
candidate variables from the AD 
the stepwise procedure. 

candidate predictor variables 
s from the first set that met the 
el for inclusion in the stepwise 
set included all two-level inter- 
iables. Inclusion of interaction 
esents a marked departure ficm 
ds research. The results of this 
the presence of interaction 
ariables. The seven predictor 
ion terms used in the stepwise 
in Table IV, 

ts of candidate predictor vari- 
dure was run on each of the two 
and SUCCESS2# wnich were defined 
Iting four models were developed 
# the derivation sample# of 1iJ40 
a set. This derivation sample 
half of the 2820 total cases in 
g facilitated cross-validation of 



*An interaction involves .the product of two or more 
independent variables# and is included in a regression model 
when the relationship between one independent variable and 
the dependent variable changes for differing values of 
another independent variable £Be£. 11]. 
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TiBlE IV 

Predictcis in Stepwise Begressiocs 



Variable 



label 



iECIPCNI 
Etilf YIGZ 
ENIEIAKG 
3EEEEWIT 
ESEG 

LEPENCIS 

SASVIEGI 

SISVIEBO 

£ASVAE1D 

SASVAEWK 

EASVAESP 

SASVAEMC 

SaSVAEEI 

2ASVAEAI 

ElACK 

CIEEE 

KUSEI 

AEMINSCB 



- FIB£1 SET — 

- AFQT EEfiCENIIIE (CE EQUIVALENT) 

- AGE Ci INDIVIDUAL AT TIME OF ENTRY 

- ENTRY FAYGEADE JEO — Oil) 

- TEEM CE ENLISTMENT (NO. OF YEARS) 

- EIGH-SCHOOL GEADUATE(I) V. OTHER (0) 

- SINGLE, NO DEPENDENTS (0), OTHERWISE (1) 
STANIAEDIZEB SCORE - GENERAL INFORMATICS 
STANIARDIZED SCORE -NUMERICAL OPEEATICNS 

SCORE - ATTENTION TO DETAIl 
SCORE - 



STANDARDIZED 
STANDARDIZED 
STANDARDIZED 
STANDARDIZED 
STANDARDIZED 
STANDARDIZED 
(1 
[1 



(1 

AD 



EIACK, ELSE (0) 
NEITHER BLACK NOE 
tAIE, (0) FEMALE 
ASVAB compos; 



WORD KNOWLEDGE 
SCORE - SPACE PERCEPTION 
SCORE - MECH COMPEEHERSICN 
SCORE - SHOP INFORMATION 
SCORE - AUTO INFORMATION 

WHITE, ELSE (0) 



SITE SCREEN 





— - 


• SECCSD SET 


— 




TERMENIT 


— 


TEEM CF ENL 


ISTMENT { 


NO. OF 


ESDG 


— 


HIGH-SCHOOL 


GRADUATE 


M) V. 


DEPENDTS 


— 


SINGLE, NO 


DEPENDENT. 




SASVAEAI 


— 


STANDARDIZE 


D SCORE - 


EIACK 


— 


(1) EIACK, 


ELSE (0) 

C) FEMALE 




SUSEX 


— 


(1) MALE, ( 




ADMINSCR 


— 


AD ASVAB CO 


BPCSITE SCREEN 


INTEEO 1 


— 


DEPENDTS * 


HSDG 




ISTEE02 


— 


DEPENDTS * 


ELACK 




INTEEC3 


— 


DEPENDTS * 


NCSEX 




INTER04 


— 


DEPENDTS * 


TERMENLT 




INTERC5 


— 


DEPENDTS * 


SASVABAI 




ISTERC6 


— 


DEPENDTS * 


ADMINSCR 




INTER07 


— 


HSDG * BLAC 


K 




INTEE08 


— 


HSDG * NUSE 


X 




INTEEOS 


— 


HSDG * TEEM 


ENIT 




INIEE1 C 


— 


HSDG * SASV 


AEAI 




INTEE11 


— 


HSDG * ADMI 


NSCB 




INTER12 


— 


ELACK ♦ NUS 


EX 




INTER13 


— 


ELACK ♦ TER 


MENDT 




INTER14 


— 


ELACK ♦ SAS 


VAEAI 





YEARS) 

OTHER (0) 
OTHERWISE (1) 
INFORMATION 



INTEE15 - ELACK ♦ ADMINSCE 

INTER16 - NUSEY * TERMENDT 

INTEE17 - NUSE2 * SASVAEAI 

ISTERie - NUSEX * ADMINSCR 

INTER1S - lERMESLT * SASVAEAI 

IRIER2C - TEEMIMT * ADMINSCR 

INTER! 1 - SASVAEAI * ADMINSCE 



the nicdels against a told-out sample, the validation sample, 
whose characteristics would not influence the crigiral 
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t iat 



develcimeiit of the icdels. 
remalicd la the niod€l at 
procedure were significant 
fcere significant at £ < 
kere significant at £ < .00 



z . 



M ul tip le Re gress ion 



3he four models de 
were rest analyzed tsing 
describe the particular st 
the data. lable V contain 
Regression procedure that 
models. lor comparative p 
discriminant analysis tech 
section of this chapter, ar 
User * s Guide provides a det 
tics that are included in 
method of computation £Bef. 
E, with tie highest R-SQUAR 
significant, is the preferr 
lie proportion of 
able explained by the set o 
called tie coefficient of 
denoted R-SCUARE. Ihe va 
developed in this study ar 
partially attributable to 
in the AE data set. Howe 
variation of the criterion 
success as defined in this 
captured by the entry-level 
predictor variables. Ihes 
vidual's performance and de 
present themselves subseg 
include satisfaction kith 



The predictor variables 
the termination of the stepwise 
at £ < .10, and most variables 
05. The four models themselves 
Cl. 

veloped by the stepwise process 
the SAS Regression procedure to 
raight line model that test fit 
s the printed output from the SAS 
was run on each of the four 
urposes, two models developed by 
nigues, discussed in the next 
e also shown in table V. The S AS 
ailed description of the statis- 
the tables, as well as their 
12 j. It can be seen that Kodel 
R and all variables statistically 
ed regression model, 
variation in the criterion vari- 
f predictor variables selected is 
multiple determination, and is 
lues of R-SQUARE for the models 
€ relatively low. This may be 
the large number of observations 
ver, it is also likely that the 
variable, length of service or 
study, is also due to factors not 
attributes and measures used as 
e factors, which affect an indi- 
cision to remain in the service, 
uent to enlistment. They may 
initial assignment, geographical 



TIBII V 

Begression Analysis Results 



Model 


Predictors 


Paramete r 


Proh > JTJ E Square E Value 






Estimate s 




A 


IBTEECEPI 


29.049 


0.0001 0.0537 11.613 




CEPINDTS 


2. 841 


0.0636 




lEBMENlT 


3.639 


0.0001 




AEKINSCE 


-1.207 


0.0260 




ESEG 


1.807 


0.0036 




CIEEE 


2.254 


0.0294 




NCSEX 


4. 171 


0. 0079 




ElACK 


1,729 


0.0131 


B 


INTERCEPT 


32. 140 


0.0001 0. 0547 13.828 




lEECENlT 


3.890 


0.000 1 




IHTEE03 


15.724 


0.0026 




INTEE04 


-2,937 


0.0173 




INTEE08 


2.113 


0.0004 




INTEE14 


0.032 


0.0398 




INTEE21 


-0.024 


0.0134 


C 


INTERCEPT 


51. 746 


0.0001 0.0220 8.069 




INTEE03 


3. 888 


0.0163 




INIEE08 


2. 137 


0.0004 




EAE^ABHK 


-0. 101 


0.0022 




ENTfPAIG 


0.416 


0.3685 


D 


INTERCEPT 


0.535 


0.0002 0. 0255 6 .238 




EEPENDTS 


0. 172 


0.0131 




TEBMENLT 


0.053 


0.0549 




ESDG 


0.115 


0.000 1 




CTEEE 


C.080 


0.0871 




SASVABAI 


0.001 


0.5630 




SASVABWK 


-0.003 


0. 1028 


E 


INTERCEPT 


0.663 


0.0001 0. 0193 14.501 




INTEE03 


0. 196 


0.0064 




INTEE09 


0.030 


0.000 1 


F 


INTERCEPT 


0. 565 


0.0309 0. 0370 6. 1C7 




INTEE03 


0.20 2 


0.0053 




INTEE09 


0.038 


0.000 1 




INTER2 1 


-0.001 


0.0576 




CTEEE 


0. 101 


0.0297 




SA5VABEI 


0.006 


0.0022 




EAST/ ABSI 


0.002 


0.1456 




CHIEC 


-0.033 


0. 1 138 




AEQTGRPS 


-0.027 


0.0092 


Iccaticn of duty 


assignment , 


command climate, unit eaplcy- 


aent , 


change in , 


marital status, societal values and pres- 


sures 


, and educaticial and 


economic opportunities cutside 


the 


ailitary. 


These factors or measures are pest hoc 



consi der aticns that aie not availaile when screening candi- 
dates lor enlistment and initial rating assignment. They 



are issues that are appropriate for more sophisti cated iteth- 
cdolcgies# for example, covariance structure analysis which 
can treat complicated enlistment standards models as a 
series of simultaneocs eguaticns tnat capture perfcruance as 
a "multiple-stage" process occurring throughout an individu- 
al's flilitaiy career- [Bef. 10] 

• 3 - Val idation 

Ihe results of the regression procedure were next 
valicatec acainst the hold-cut sample. Each of the regres- 
sion icdels was derived from a uniform random sample, the 
derivaticr sample, of the observations in the AD data set- 
Ihe EAS Eegressicn procedure was employed to calculate the 
parameter estimates for the associated predictor variables 
using data from observations in this derivation sample. Ihe 
SAS Eccre procedure then used these estimates to predict the 
value cr the critericr variable for each cbserva ticn ir the 
validation sample. Einally, these predicted values were 
correlated with the actual values of the criterior in the 
validation sample. Ihese correlations represent the valida- 
tion coefficients for each model, and are shown in Table VI. 

TABU VI 

Begression Model Validities 



Model 


Eirst Validity 
Coefficient 


Second Validity 
Coefficient 


A verag 
Validi 


A 


0. 21342 


0. 20317 


0.21 


E 


0.21536 


0.21683 


0.22 


C 


0. 1445S 


0. 13612 


0.14 


c 


0. 17381 


0. 13766 


0.16 


E 


0. 1779C 


0- 12751 


0.14 


E 


0. 14430 


0. 13531 


0. 14 



Note: Ihe First Validity Coefficient is the result of 

the cross-validation, and the Second Validity 
Coefficient results from the double cross- 
validation. The reported average is the simple 
arithmetic mean. 
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As a furthex checJc cf the validity ot the six 
regressiCD models, the process was repeated hy deriving 
parameter estimates irom the validation sample, and using 
these estimates to correlate the actual and predicted values 
cf the criterion for chservaticns in the derivation sample. 
Ihis dcuhle cross-validation technique is described in 
detail ty Campbell £Bef<. 13]. Table VI also contains tnis 
second set cf validity coefficients for the six models. 

Occasionally , concern is expressed that random 
samples may not be from a homogeneous population, and, 
therefore, the sample correlations may differ from the popu- 
lation correlations. One method of addressing the problem 
of heterogereous samples is to average the correlation coef- 
ficients to obtain a single estimate of the population 
correlation. If the sample correlations are of about the 
same value and if they are not too large, as is the case 
with this study, this simple arithmetic mean will suffice. 
Kere this not the case, however, another technique is tc use 
transformations to Fisher's z coefficients. [Bef. 14] The 
simple arithmetic average correlations are also presented in 
Table VI. Appendix C contains the program used to calculate 
validity coefficients. 



C. DISCEIBIHAHT ANAIISIS 



The third statistical 
was discriminant analysis, 
allows cbservaticns to be c 
or categories on the basis 
As was done with regress 
models were derived ard ana 
of the data set, and teste 
cbservaticns. Variables 
using stepwise techniques. 



method employed in this research 
The use of discriminant analysis 
lassified into two or more groups 
of cne or more numeric variables, 
ion analysis, the discriminant 
lyzed from the derivation sample 
d against the hold-out sample of 
in the model were again selected 
The resulting two models, and 
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the fcur mcdels develcfed by regression analysis, were then 
analy 2 ed using the Sii£ Discriminant procedure. Ihe prcgram 
used in this analysis is contained in Appendix D, alcng with 
a discussion of discriminant analysis assumptions and 
methodology . 

1 - Ste rwis e Di sori mina nt A nalysi s 

Ihe SAS Stepwise Discriminant procedure was employed 
to select the most useful discriminating variables. It is a 
logical and efficient method of choosing an optimal ccmbira- 
tion of variables. Iheir selection to enter or leave the 
model is based on either the significance level of an F test 
or a squared partial correlation criterion. The selected 
variables are those which contribute most to the discrimira- 
tory power of the model. £Eef. 12] 

Ihe variables chosen by the stepwise discriminant 
process were selected from the 46 candidate variables shown 
previously in Table 1. The entry-level attributes and meas- 
ures that were considered likely to be good predictors, as 
discussed in Chapter II, represent 25 of these candidate 
variables. The other 21 variables are the two-level inter- 
action terms considered during regression analysis of the AD 
data set. The procedure was run on each of the two 
criterion variables, SOCCIAF and SDCCESS2, discussed in 
Chapter III. The criterion variables define the groups into 
which each observation will be classified by discriminant 
analysis, and are called classification variables. 

2 • Discriminant A nalys is 

As previously mentioned, discriminant analysis 
involves the study of differences between two or more 
groups, defined by a single nominal level variable, with a 
set of common discriminating variables. 
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3h€ SAS Discriniinan 
means for conductinc discr 
set. lie fxocedure ias run 
oped ty stepwise regress 
processes. Eaci ctservat 
which it has the soalles 
Also taken into account w 
group mentership. Jhese p 
freguency distribution of 
the sample data set. Ihis 
this study is attempting 
process in use at tie time 
latle VII contains 
ysis. Each procedure inco 
group mentership, indicate 
IfilOEE. Ihe classificati 
elements which depict the 
predicted (column) classif 
failure (0) groups. The f 
in the matrix are: 

(0,C) Ihe number cf fa 
failures 

(1,C) Ihe number of su 
be failures 
(0,1) Ihe number cf fa 
successf ul 

(1,1) Ihe number of su 
be successful 

Each section first certains 
eped by applying the classi 
tion sample. Tie second 
results of applying this 
ebsermatiers in the hcld-ou 
model . 



t Analysis procedure provided the 
iminant analysis cf the AD data 
on each of the six models devel- 
ion and stepwise discriminant 
ion is placed in the class from 
t generalized squared distance, 
ere the prior probabilities of 
notabilities are obtained from a 
actual successes and failures of 
was considered appropriate since 
to improve upon the selection 
the individuals enlisted, 
the results of discriminart anal- 
rperated the prior probability cf 
d cn the classification matrix as 
cn matrix is divided into four 
number of actual (row) versus 
ications into successful (1) cr 
cur elements (actual, predicted) 

ilure cases predicted tc be 

ccessful cases predicted tc 

ilure cases predicted tc he 

ccessful cases predicted tc 

the classification matrix devel- 
fication function to the deriva- 
classif ica tion matrix depicts the 
same classif icaticn function to 
t sample, thereby validating the 
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classif 


icaticns, 
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ur € cf 
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model 
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The 


sec 


end rate 


is 



%ier€ classiiied as (1,1) 
predicted as successful, 
and it provides a measure 
would iave performed. It 
selecticD strategy success 
is ar iupcrtart considerat 
will ie addressed further i 
analysis, fiodel fl is again 
the crly cne that iaproves 
existence during the timefr 
do illustrate how t 
example cf the classificat 
explained. The model corr 
as failures and 1079 cbserv 
these correct classificati 
total cf 1440 observations 
test the ncdel's accuracy, 
applied to the validation s 
matrix indicates 47 failure 
were correctly classified. 
79 percent cf the total of 
sample. dhe ccnsistency 
model is valid. The mode 
rate experienced by the Nav 
at the time the observation 
However, it is di 
upon such a high success 
attributes and measures mi 
success as defined in this 



ows two rates relevant to each 
e first rate is the percentage of 
called the "hit rate”, which 
e accuracy of the discriminant 
the percentage of enlistees who 
ccirpared to all cases whc were 
It is called the "success rate”, 
of how well this selection model 
may be compared to the original 
rate, the priors. Success rate 
ion with utility analysis, and 
n Chapter V. As with regression 
the preferred model since it is 
upon the selection strategy in 
ame of the AD data set. 
he results may be interpreted, an 
ion matrices for Model A will be 
ectly classified 49 observations 
aticns as successful. The sum of 
cns represents 73 percent cf the 
in the derivation sample. To 
the classif ica ticn function is 
ample. The second classif ication 
and 1039 successful observations 
The sum represents a hit-rate of 
1380 observations in the hold-cut 
of these hit-rates indicates the 
1 tetters the 85 percent success 
y with the selection process used 
s enlisted. 

fficult to significantly improve 
rate. Additional entry-level 
ght be found to better capture 
study. An alternate approach 
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would ie to redefine the success criterion- In either 
however, tie methodclogy presented in this chapter m 
siniilarlj fcllowed tc develop and test enlistment stan 
models. Ihe next chapter will discuss a method by whic 
utility cf such an effort may be measured- 



case, 
ay be 
dards 
h the 



3S 



TA£1£ VII 

Discriiioant Analysis Besults 



liodel friers 
C 1 



Classification Hatrir Hit Success 

Rate Rate 



A 



B 



0.15 0-85 



0. 15 0.85 



Iredicted 

SCCCIAF 





C 


1 


Total 


Actual 


0 49 


16 1 


210 


SUCCIAF 


1 151 


1079 


1230 


lotal 


200 


1240 


1440 



Predicted 

SDCCIAF 





0 


1 


Total 


Actual 


0 47 


136 


183 


SUCCIAF 


1 158 


1039 


1197 


Total 


2C5 


1 175 


1380 



Predicted 

SCCCIAF 







0 


1 


Total 


Ac t u a 1 
SUCCIAF 


0 


1 


209 


210 




1 


2 


1228 


1230 


lotal 




3 


1437 


1440 






Predicted 

SUCCIAF 








0 


1 


Total 


Actual 

SUCCIAF 


0 


0 


183 


183 




1 


1 


1 196 


1 1 97 


lotal 




1 


1379 


1380 



0. 78 



0.7S 



0 . 85 



0 . 87 



0.87 



0.88 



0.85 



0.87 
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Model 


friers 


Classification Matrix 


i it 


Success 




C 1 










Rate 


Rate 


C 


0.15 C.85 






Predicted 

SDCCIAF 




0.83 


0.86 










0 1 


Total 










Actual 

SUCClAf 


0 


15 195 


210 












1 


46 1184 


1230 










lotal 




61 1379 


1 440 














Predicted 

SCCCTAF 




0 .83 


0.87 










0 1 


Total 










Actual 

iOCClAf 


0 


8 175 


183 












1 


€2 1135 


1197 










Total 




70 1310 


1380 







D 0.23 0.77 




Predicted 

SDCCESS2 




3 .36 


0.86 






0 1 


Total 






Actual 

SUCCFSS2 


0 


302 35 


337 








1 


889 214 


1103 






Total 




1191 249 


1440 










Predicted 

SDCCESS2 




0 .35 


0-84 






0 1 


Total 






Actual 

SUCCISS2 


0 


277 4 1 


318 








1 


850 212 


1 062 






Total 




1127 253 


1380 







4 1 






Tiers 

1 



Classification I'latrix 



ait Success 
Rate Rate 



0.:3 0,11 



R redic ted 
S0CCESS2 



Total 



0.70 0.7S 



0.23 C.77 



Actual 

SUCCISS2 


0 


S5 


242 


337 




1 


187 


916 


1 103 


Total 




262 


1158 


1440 






f xedicted 
S0CCESS2 








C 


1 


Total 


Actual 


0 


1 12 


206 


318 


SUCCESS2 


1 


174 


888 


1062 


Total 




266 


1094 


1330 






f redicted 
SDCCESS2 








C 


1 


Total 


Actual 

SUCCESS2 


0 


238 


99 


337 




1 


554 


549 


1 103 


Total 




7S2 


648 


1 440 






f redicted 
sacCESS2 








C 


1 


Total 


Ac t u a 1 
S0CCISS2 


0 


2 18 


100 


313 




1 


5S1 


47 1 


1062 


Total 




809 


571 


1380 



D.72 0.7S 



0.55 C.85 



0.50 C.82 



Ti. OII-LITK AN ALYS IS 



Ibis chapter cortains an explanation of ta e applica- 
tilitj of utility aralysis to the development of selection 
procedures, and discusses the theory of utility analysis. 
3he methcdclogy used in this study to apply utility analysis 
is descrited, along iith sections on the calculation of cell 
profanities for regression and discriminant models, and a 
section on estimatinc cell utilities. More detail on the 
calculations and programs used for utility analysis may be 
found in Appendix E. 



A- ECBfCSE OF DULIIY ANALYSIS 



Ihe development of a model for use in predicting an 
applicant’s future performance in a particular jot is a very 
necessary part of most selection procedures. However, the 
model itself does not constitute enough informaticr to 
enable a decision to be made on whether or not it is worth 
implementing. The val idity of the model is one indicator of 
its potential usefulness but, as will be seen, other factors 
significantly affect the usefulness of a model. All organi- 
zations would find it valuable to be able to judge the worth 
of their strategy in guantitative terms, particularly when 
comparing their existing strategy to a newly developed, 
competing strategy. A framework is needed which will allow 
the evaluation of a selection model in terms of the institu- 
tional gains (or losses) that are expected to result when 
that model is used to guide decisions on selection. 
Classical utility analysis provides such a framework, ard it 
allows the calculaticr of usefulness to be made in terms of 
actual dcllars, which facilitates the comparison of one 
selecticr model with another. 
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lEICBY CF UTILITY AMALYSIS 



In the context cf utility analysis, there are fcur 
cutccnes c± interest associated with selection decisiors. 
These cutccnes are; 



* Positives (VP) , which refers to the numter 
cf applicants that are hired and who turn cut to 
he successful cn the jot. 

* 1j 2§S Pos iti v es (FP) , which refers to the numher 
cf applicants that are hired and who turn cut to 
te unsuccessful on the jot. 

• iJiSi Negatives (FN) are the people who wa re not 
hired, hut uic would have been successful if 
they had teen tired. 

• Valid Neg atives (VN ) are the people that were 
net hired, and who would have been unsuccessful 
if they had been hired. 

It is ct^icus frem tie terminclcgy and the explanaticns that 
VP and VK constitute correct selection decisions, and FP and 
FN represent selecticr error. 

These outcomes are perhaps easier to understand with the 
aid cf a diagram. Figure 5.1 shows the relationship between 
hypothetical predicted (fro n a model) and actual scores on a 
job perfcriance criterion fer a large number cf job 
applicants. 

The elJipse contains the data on predicted and actual 
criterion scores. In this diagrammatic example, the cerre- 
laticn between the predicted and actual scores (the mcdel's 
validity) is apparent — higher predicted scores are associ- 
ated kith higher actual scores and vice versa. The pcint _y 
cn the vertical axis is the dividing line between what is 
considered to be successful performance (say completicn cf 
^8 months cf service for first term enlistees), and unsuc- 
cessful performance (less than 48 months service before 
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iCICAl 




reject 



accept 



SCORE 



Fignre 5.1 Hypotietical Predicted and Actual Scores 

disciarge) . In utility analysis the term base rate is 
defined as that prcpcrtion of current employees whc are 
considered to be successful. If seven out of everj ten 
emplcjees are successful, then the base rate is .70. Ihe 
point X on the horizontal axis is referred to as the cu t 
score . If an applicant’s predicted score (from the mcdel) 
is greater than x, then that person will be accepted 
(hired), ard if their predicted score is less than x, then 
they will be rejected (not hired) . The location of x cn the 
horizcntal axis will often depend on the selection ratio, 
which is the proportion cf applicants that need tc be 
accepted in order tc fill a certain number of jobs. If, 



ever the course cf ere year, 80 job vacancies are expected 
to occur and if 100 axplicacts over the year are expected to 
apply fci those jobs, then the selection ratio needs to be 
.80 if all vacancies are to be filled. In the happy circum- 
stance (from the recruiter's point of view) where there are 
far acre applicants than jots, then the cut score x will be 
chosen sc as to aaxinize the ut ilit v cf the selection proce- 
dure. 'Utility is defined here to mean the expected gain in 
dollars that results from a particular selection strategy. 

lie lines generated from the base rate and the cut score 
divide the sample into four cells as shown. Each cell 
contains the people who are classified into each of the four 
cutcemes cf interest. In cells 1 and 2 are people whose 
predicted score is higher than the cut score. Therefore 
these people would be classified as accept. These accepted 
people (the positives) are further divided into those who 
would be successful (valid positives) and those who would be 
unsuccessf ul (false positives) . Cells 3 and 4 contain the 
people whe scored lower than the cut score on the predictor, 
and these would be classified as reject. Again, seme of 
these rejected cases would have been successful (false 
negatives), and some would have failed (valid negatives). 
In utility analysis it is convenient to convert the cell 
counts (represented by VP, IP, IN and VN) to propertiens of 
the overall sample, so each count is divided by the number 
cf people in the sample and the cell probabilities (PVP, 
PIP, PIN and PVN respectively) result. 

Cne further result of interest is the s uc cess rate. The 
success rate is defined as the proportion of hired appli- 
cants whe are, or will be, successful. It is simply feurd by 
dividing PVP by the sum of PVP and PFP. 

Given the concepts and terminology outlined above, it is 
now possible to discuss in general terms the factors that 
will affect the cell probabilities which, in turn, affect 
the expected utility. 
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1. Jo del Valid! t y 



Ihe model's validity, as measured ty the correlation 
tetween predicted and actual scores, is one factor that 
deternines the degree of selection error resulting from the 
selecticr strategy. If the validity is high, then the 
proportions of oorrectly classified people (PVP and PVN) 
kill te higher, and the selection error {PFP and PFN) will 
he lower. Vineberg and Joyner in their review of almost 150 
military studies related to jot performance prediction, 
found that validities range from .15 to .40, from a total of 
350 validity coefficients £Bef. 15]. Generally, validities 
withir this range wocld te considered as low or medium. 

2. Ease Rate 



If the existing base 
greater), then it means that 
currently in use has a high r 
potentially successful appli 
stances, it is unliJcely tha 
staffing process would yield 
correctly selecting ajplicarts 
the cell probabilities for P 
higher than for PFP ard PVN. 



rate is high (say 
whatever selection strat 
ate of success in ident 
cants. Under these c 
t using a new model i 
much of an improveme 
. A high base rate mean 
VP and PFN are going 
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3 . ■ Select ion Rat io 



Assuming the model 
ratio, the more useful t 
successful applicants. D 
that the organization can b 
it hires. Naturally, it w 
score highest on the predic 
most likely to be successfu 
cut score) will mear that 



is valid, the lower the selection 
he model will be in identifying 
ecreasing selection ratios mean 
e increasingly selective in whom 
ill tend to accept only these who 
tor, those who are also predicted 
1. A low selection ratio (high 
PVP and PFP will be small. It 
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also fellows that a lew selection ratio will yield a higher 
success rate — although lew people will be hired, most of 
them will represent correct selection decisions (PVP) . 

C, iSaieillNG THE OIIIITY CP 1 HCDEl 

Ihe expected utility (EU) cf a model is found by summing 
the picdccts of each cell probability and its associated 
cell utility (Ul, 02, 03 and 04), and subtracting the cost 
cf giving tie test tc an applicant (OT) . 

EO = CliPT/P) + 02 (PIP) C3 (PFN) + 04(PVN) - OT (5.1) 

Apperdix E contains detailed descriptiors on how cell 
probabilities and cell utilities are determined. Per a 
discriminant model the cell p icbabilities may be readily 
derived from the output of the SAS Discriminant procedure, 
because the model classifies cases into predicted successes 
and predicted failures. In the regression model the cut 
score is not Xnewn ir advance, so cell probabilities that 
result from a number cf possible cut scores are calculated, 
and a cut score is eventually chosen based on which set of 
cell p icbabilities maximizes the utility of the model. 

The formula for calculating the expected utility of a 
model recuiies that a utility be assigned to each selection 
outcome. These cell utilities are designated U1 through U4 
and are associated vith the outcomes VP, FP, FN and VN 
respectively. The Billet Cost Model provides an estimate of 
the cost tc the Navy cf staffing a billet. In this study it 
is assumed that this cost is egual to the marginal product 
cf a successful sailer, and so the utility of a valid posi- 
tive (Cl) is assigned a value of $24,163 [Bef. 16]. No 
proven technigue exists for estimating the cell utilities 
for the three other selection outcomes. Individual 
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circuDstances and fievailing market conditions make it 
difficult tc estimate these outcomes with real confidence, 
so these cell utilities were estimated relative to U1, and a 
miner form cf sensitivity analysis was conducted. Ihe cell 
utility cf a false pcsitive (U2) was assigned values cf -.5, 
-1 and - 2 . Valid negatives (D4) were assigned an egual and 
opposite utility to U2, and false negatives (U3) were 
assigned values cf 0, -.25 and -.5. Table VIII shows seven 

different sets of cell utilities that were considered. 

TAEIE VIII 

Belative Cell Otilities 



U1 


U2 


U3 


U4 


1. 0 


-0.5 


-0.25 


0. 5 


1. 0 


-0.5 


0.0 


0. 5 


1. 0 


-1.0 


0.0 


1.0 


1. 0 


-2.0 


0.0 


2. 0 


1. 0 


-0.5 


-0.5 


0. 5 


1. 0 


-1.0 


-0.5 


1. 0 


1- 0 


-2.0 


-0.5 


2. 0 



Tie cest of administering a test (UT) is of significance 
if the costs of testing are different for competing selec- 
tion strategies. The models developed in this study tse 
data cattered from the existing tests, and therefore the 
costs cf testing will remain much the same. Thus in this 
context, OT may be ignored since it applies equally tc the 
cld and rew tests. 

I. BIIDIIS OF OTIlIll ANAIISIS 

Tables IX and X certain the results of the utility anal- 
ysis ct the regressicr and discriminant models respectively. 
The "Percent Change in EU" column is the result cf the 
comparison cf the model's utility with the utility cf the 
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Navy's ciiginal selection strategy (base line utility). A 
positive percentage change in ZU indicates that the aaxittum 
utility cltainable ficm the ncdel is higher than the utility 
cf the original selection strategy. An increase in utility 
of say $50 iieans that the Navy saves $50 ror each selection 
decisicr (correct or incorrect) that is made by using the 
model rather than the original strategy. For the models 
kith the 5DCCTAF or 1AFMS1 criterion, the base rate is .861, 
i.e., 66.1 percent ci the people selected by the Navy were 
successful. These people can be thought of as the valid 
positives cf the original strategy and the remaining 13.9 
percent are false positives. (For tne SUCCESS2 criterion 
these figures are 76.8 percent and 23.3 percent.) 
Unfortunately it is net possible to calculate the values of 
false and valid negatives so these are considered tc be 
zero. For the IAFMF1 or SUCCTAf criterion then, the cell 
probalilities for the original selection strategy are £V£ = 
.861, PF£ = .139, PFK = 0 and PVN =0- The base line utility 
for each cf the three differert combinations of 01 and U2 
can then be calculated. The model utilities are then 
compared tc these base line utilities and the differences, 
expressed as a percentage of the base line utilities, are 
reported. Similarly the base success rate cf the original 
strategy is also .861 (for the TAFMS1 or SUCCTAF criterion). 
The cclujin "Change in Succrate" reports the actual differ- 
ence between the models' success rates and the base success 
rates. The column "SBATIO" shows the selection ratio that 
results when the cut score is chosen so as to maximize the 
utility, for each set of cell utilities. 
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Fearessicn Models 



For most sets of cell utilities, the regression 
models developed shew little improvement over the original 
selecticr strategy. In most cases the selection ratio is 
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very close to 1 and the percentage increase in expected 
utility is very small. Inis is not a surprising result 
because the model validities are relatively low (around .20) 
and, more significartly, the tase rates are very high at 
.861 and .768. It is interesting to note, however, that 
when the ccsts of a false positive and the benefits of a 
valid negative are high, then the selection ratio is driven 
down, and the utility and success rate go up. 



Eiscriminant bedels 



In general the discriminant models did net perform 
as well as the regression models or the Navy* s original 
selectici strategy. For seme models the percent change in 
EU was a significant positive number, but these were usually 
associated with extreme assumptions of cell utilities. In 
additicr to the factors mentioned in the previous subsec- 
tion, this poor performance is because the discriminant 
models laeJe the flexibility to vary the cell probabilities 
depending on the values of the cell utilities. Ihers is no 
eptien tc vary predictions depending on the conseguencss of 
correct and incorrect selection decisions, and thus only cne 
set of cell probabilities is available for each discriminant 
model. 
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TABIE IX 





Utility 


Eesults 


- Eegression Models 






MCEEI 


£1 


U2 


U3 


04 


EU ASDCCRATE 


SFATIC 


A 


1.0 


-0.5 


-0.25 


0.5 


0.12 


0.001 




C.SS8 




1,0 


-0.5 


C 


0.5 


0. 14 


0. 001 




C.SS8 




1.0 


- 1. 0 


0 


1.0 


0.34 


0.001 




C. SS8 




1.0 


-2.0 


0 


2. 0 


5.85 


0.022 




0.8C5 




1.0 


-0. 5 


-C.5 


0.5 


0.11 


0. 001 




C.SS8 




1.0 


-1.0 


-0.5 


1. 0 


0.32 


0. 001 




C.SS8 




1.0 


-2. 0 


-0.5 


2. 0 


1.25 


0. 003 




C.9S8 


E 


1.0 


-0. 5 


-0.25 


0. 5 


0.0 


0.0 




1.0 




1,0 


-0. 5 


C 


0.5 


0.0 


0.0 




1.0 




1 .0 


- 1. 0 


0 


1. C 


0.0 


0.0 




1.0 




1 .0 


-2. 0 


0 


2.0 


6.28 


0.023 




C.8 10 




1.0 


-0.5 


-0.5 


0. 5 


0.0 


0- 0 




1.0 




1.0 


-1.0 


-0.5 


1.0 


0.0 


0-0 




1.0 




1.0 


-2. 0 


-0.5 


2. 0 


0.28 


0. 002 




C.S85 


C 


1.0 


-0.5 


-C.25 


0.5 


0.0 


0.0 




1.0 




1.0 


-0.5 


0 


0. 5 


0.0 


0. 0 




1.0 




1.0 


-1.0 


0 


1.0 


0.05 


0. 001 




c.sss 




1.0 


-2. 0 


0 


2. 0 


5.79 


0.016 




C, £71 




1.0 


-0.5 


-0.5 


0.5 


0.0 


0.0 




1.0 




1.0 


-1.0 


-0.5 


1.0 


0.0 


0- 0 




1.0 




1.0 


-2. 0 


-0.5 


2. C 


0.4 


0.004 




C.972 


£ 


1.0 


-0.5 


-0.25 


0.5 


0.15 


0. 002 




C- SS5 




1.0 


-0. 5 


C 


0. 5 


0.22 


0-002 




0-9S5 




1 .0 


-1.0 


0 


1.0 


5-1 


0. 027 




C.861 




1.0 


-2. 0 


0 


2. 0 


72.98 


0. 074 




0.2£3 




1.0 


-0.5 


-0.5 


0.5 


0.08 


0. 002 




C. S£5 




1.0 


- 1, 0 


-0.5 


1.0 


0-76 


0- 002 




0.995 




1.0 


-2.0 


-0.5 


2.0 


35.44 


0.014 




C. 6C6 


E 


1.0 


-C. 5 


-C.25 


0.5 


0.0 


0.0 




1.0 




1.0 


-0.5 


0 


0.5 


0.0 


0.0 




1.0 




1 .0 


- 1. 0 


0 


1.0 


3.5 1 


0.033 




0.799 




1.0 


-2. 0 


0 


2. 0 


61.76 


0. 124 




0.C56 




1.0 


-0. 5 


-C.5 


0.5 


0.0 


0.0 




1.0 




1.0 


- 1. 0 


-0.5 


1. 0 


0.0 


0. 0 




1.0 




1.0 


-2. 0 


-0.5 


2. 0 


33.51 


0.033 




0.799 


E 


1.0 


-0. 5 


-0.25 


0,5 


0. 1 4 


0.001 




C.997 




1.0 


-C. 5 


C 


0. 5 


0.16 


0.001 




0.997 




1.0 


-1.0 


0 


1.0 


4.77 


0- 013 




C. £ 16 




1.0 


-2. 0 


0 


2.0 


79-18 


0. 063 




0.5 11 




1 .0 


-0.5 


-0.5 


0.5 


0.11 


0.001 




C.997 




1.0 


- 1. 0 


-0.5 


1.0 


0.46 


0.001 




C.997 




1.0 


-2.0 


-0.5 


2. 0 


36.61 


0. 014 




0- £C7 


Note: 


I he base i 


11 tilitie 


s f cr 


Models k. 


E and 


C 


are - 




$191 12 (wheu U2 i 


s -0. 


5). $17428 


(when 


U2 


is 




-1.0) 


and 


$14061 


(when 


uz is -2) , 


and t he 


tase 




success rate is 0 


.861. 












lie base i 


1 tilitie 


s f or 


Models D. 


£ and 


F 


are - 




$ 15744 (wh€D 02 i 


s -0. 


5), $ 12938 


(when 


U2 


is 




-1.0) 


and 


$7326 ( 


when 


U2 is -2.0) 


, and 


the tase 




success rate is 0 


.768. 
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TABIZ X 



Dtility B€£ults - Discriminant Models 



MCIEl 


Cl 


U2 


U3 


C4 


A% EU 


A SUCCRAr E 


SBAIIC 


£ 


1.0 


-0.5 


-0.25 


0.5 


-13. 0 


0.016 


0.856 




1.0 


-0. 5 


C 


0. 5 


-9. 5 








1 .0 


- 1. 0 


0 


1. 0 


-5. 8 








1.0 


-2. 0 


0 


2. 0 


4. 6 








1.0 


-0. 5 


-C.5 


0.5 


-16.5 








1.0 


- 1. 0 


-0. 5 


1. 0 


-13. 3 








1.0 


-2. 0 


-0.5 


2. 0 


-4. 8 






£ 


1.0 


-0. 5 


-0.25 


0.5 


-0. 1 


0. 0 


0. 9S9 




1.0 


-0. 5 


0 


0. 5 


-0. 1 








1.0 


- 1. 0 


0 


1, C 


-0. 1 








1.0 


-2. 0 


0 


2. 0 


0. 1 








1.0 


-0. 5 


-C.5 


0. 5 


-0. 2 








1.0 


-1. 0 


-0.5 


1.0 


-0. 1 








1.0 


-2. 0 


-0.5 


2. 0 


0. 0 






C 


1.0 


-0.5 


-0.25 


0. 5 


-5.0 


0.074 


C.954 




1.0 


-C. 5 


- C 


0.5 


-3.8 








1.0 


- 1. 0 


0 


1. 0 


-3. 1 








1.0 


-2. 0 


0 


2. 0 


-1.0 








1.0 


-0. 5 


-C.5 


0. 5 


-6. 2 








1.0 


- 1. 0 


-0.5 


1. C 


-5.7 








1.0 


-2. 0 


-0.5 


2. 0 


-4. 3 






£ 


1.0 


-C. 5 


-0.25 


0. 5 


-86.8 


0.081 


0. 178 




1.0 


-0.5 


0 


0. 5 


-63. 1 








1.0 


- 1. 0 


0 


1.0 


-38. 5 








1.0 


-2. 0 


0 


2. 0 


67. 5 








1.0 


-0. 5 


-C.5 


0. 5 


-110.0 








1.0 


-1. 0 


-0.5 


1. 0 


-96. 8 








1.0 


-2. 0 


-0.5 


2. 0 


-25.3 






£ 


1.0 


-0.5 


-C.25 


0. 5 


-13.3 


0. 033 


C.7SS 




1.0 


-0. 5 


C 


0. 5 


-8. 4 








1.0 


-1. 0 


0 


1. 0 


3.5 








1.0 


-2. 0 


0 


2. 0 


54.6 








1.0 


— 0- 5 


-0.5 


0. 5 


-18.2 








1.0 


- 1. 0 


-0.5 


1.0 


-8. 4 








1.0 


-2.0 


-0.5 


2. 0 


33. 5 






£ 


1.0 


-C. 5 


-C.25 


0. 5 


-53.1 


0. 069 


0. 432 




1.0 


-0. 5 


C 


0. 5 


-37.5 








1.0 


- 1.0 


0 


1.0 


-15.4 








1.0 


-2. 0 


0 


2.0 


79. 4 








1.0 


-C.5 


-C.5 


0- 5 


-68. 7 








1.0 


- 1. 0 


-0.5 


1- 0 


-53.3 








1.0 


-2. 0 


-0.5 


2. 0 


12. 5 







Ncte: lie iase utilities for Models A, B and C are - 

$19112 (whei 02 is -0-5). $ 17428 (when 02 is 

-1-0) and $ 14061 (when 02 is - 2 ), and the tase 
success rate is 0-861- 

Ihe base utilities fcr Models D. E and F are - 

$ 15744 (wher 02 is -0-51, $ 12938 (when 02 is 

-1-0) and $7326 (when 02 is -2-0), and the rase 
success rate is 0 -768- 
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VI. C0H CIC5I CHS AHE BECOMMEMDA TIO MS 

liis study set cut to pxcvide a method foe develcping 
eclistaert standards models which improves upon sinilar 
processes presently ir use. Toward that end, significant 
advances have been made, particularly when compared tc prior 
studies conducted at the Naval Postgraduate School. The 
technigues used provide a much more comprehensive approach 
to model development. They employ regression analysis to 
fully develop the stepwise regression results. In addition, 
stepwise discriminant procedures were used to find an 
optimal model prior to full discriminant analysis. 
Alternative criteria for measuring successful operational 
performance, including a continuous length of service 
criterion, were incorporated in the models. Finally, each 
model was analyzed using toth regression and discriminant 
analysis teclmigues. 

Perhaps most significant is the presentation of a means 
ly which tie benefits from such efforts may be gauged. The 
development of innovative utility analysis programs affords 
future researchers ar excellent opportunity to measure in 
monetary terms the benefits tc be derived from implementing 
a new selection strategy. It is important to reiterate that 
the statistical and utility analysis technigues presented in 
this study may be easily applied or modified to accommodate 
selecticn standards model development for any of the mere 
than SO Navy ratings contained in the master data base. 

A secondary purpose of this study was tc discover 
whether the models developed improve upon existing selection 
and assignment strategy for the AD rating. By and large, 
the models presented do not appreciably enhance the 
processes used since 1976. The models do, however, allow 
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cue tc fccus on some specific considerations in the current 
screening processes. For exaiple. Models A, B, and C allow 
polic;j naJcers to consider length of service in months, and 
to vary the criterion for measuring success. This capa- 
tility is particularly appropriate for use in a dynamic 
recruiting market. 

A. EI5DIIS 

Ihis study yielded several ether results worth noting. 
Ihe term cf enlistment variable may be used tc predict 
success now that it has been corrected to reflect a ctive 
duty obligation. Ibis is particularly important when 
assessing Haval Eesexvists, whose six year contract gener- 
ally reguires only three years of active service. Ihe 
change frcm a negative to a significantly positive ccrrela- 
tion cf HEMENLT on the criteria is one of the mere impor- 
tant discoveries of this research effort. 

Ihis study also determined that the usefulness of the 
BCREEK cempesite score in predicting gob performance meas- 
ures was virtually nonexistent. It appears tc be mere 
appropriate to use the SCREEN score componen t s in the 
models, at least when attempting to predict oper at icn al job 
performance. Ncn traditional ASVAB subtest scores, such as 
Auto infermation, may alsc be appropriate for use in the 
screening process. Another significant finding cf this 
study is the definite presence of interaction effects. 
Considering personal measures on an individual ir ccnjunc- 
tion with ether measures represents a marked change in 
current selection practices. 

Ic summarize the results of the statistical analysis, 
the variables measuring term of enlistment, education, 
dependents status, sex and race emerged as repeatedly 
significant predictors cf successful operational 
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perfcinaxce. The ccnposite aeasure of eligibility for the 
AD rating, and the ASVAB Auto Information subtest score, 
were alsc significant predictor variables. In additicn, 
Eodel E was shown to be the best regression and discr iminant 
model . 

Ihe results of tie application of utility analysis stow 
that the regression models developed in this study perform 
as well as or better than the original Navy strategy which 
was used as the comparison (base line utility). It is 
important to note however, that the methodology used in this 
part of the study ensured that regression models will 
provide a maximum utility at least egual to the base line 
utility. Ihis is because the technique allows the cut sccre 
to be set sc low that all cases are accepted. Models A and 
I are ccrsicered to be the best of the models because they 
provide for significant increases in utility without having 
to resort to impractical ly low selection ratios. Ihe 
discriminant Models A and E are better than the others 
because improvement over base utility is possible, depending 
cn the cell utilities. 

As was mentioned in Chapter V, the high existing base 
rates are an indication that newly developed models are 
unlikely to produce superior results. Utility analysis is 
hindered by the difficulty of confidently estimating the 
individual cell utilities, and this is an area that is in 
need cf further research. It is also difficult to compare 
new selection strategies to existing ones because it is 
impossible to classify the cases rejected by the existing 
strategy as valid or false negatives. Data of this scrt can 
only be obtained bj testing all applicants and then 
accepting all of them, regardless of their relatiorship to 
the cut sccre, or to the desired selection ratio. 
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E, EECCEiJEMDAlICNS 

C€£pit€ the advaices made ty this study, there renaias 
oany cyycrtmiities tc refine the models presented for the AD 
rating, and to develop models for ether Navy ratines, 
frocedurally, these opportunities include testing for curvi- 
linearity of the models, expanding the interaction terms to 
three cr more levels, and seeking different combinations of 
ASVAE suttest scores as potential predictors. Ihere may 
also be ether measures not evaluated by this study that are 
significant operational performance predictors, such as 
enlistment vaivers, lEP status, or involvement with civil 
authorities . 

Consideration should also be given to altering the 
criterion variables. One particularly promising adjustment 
may he tc change the criterion to reflect achieving E-5. 
Ihis may be appropriate since the models developed appear to 
do a letter job of predicting longer LOS, as indicated by 
preliminary residual analysis. Developing separate .models 
that yield predictions of shorter LOS may also be in erder. 

Ihe multiple-stage analytic approach referred tc in 
Chapter IV also appears to be a promising technique. Such 
analysis might consider change in dependent status, perform- 
ance evaluations, or advancement exam results as variables 
in a mcdel. 

3c improve the usefulness of utility analysis it is 
important that a technique he developed to estimate cell 
utilities with reasonable accuracy. Such a technique needs 
to he able to control for changes in the recruiting market, 
and he sensitive tc the changing Navy requirements for 
recruits- It is also important that data be gathered on 
applicants who are net accepted into a particular rating, tc 
allow researchers to determine if they were reclassified to 
another rating, or rejected entirely. 
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Ie ccDcIusion, it is clear that continued efforts to 
develcf selection standards models for all ratings are 
essential. For it is through these efforts that the cost cf 
training ard maintaining Navy personnel will be reduced, 
Ihe resultant experienced career force will ensure the Navy 
is ready tc meet any global commitment. 
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APPENDIX A 

DATA EASE DEiElCPMENT PROGBAMS 



This apiendix picvides the SAS prograas used in this 
studj to access the naster data base, develop the AC data 
set, anc create new predictor and criterion variables, as 
discussed ir Chapter III. Each program contains the job 
centred language information appropriate to tne Naval 
Postgraduate School's IBS 3033 computer system. Statistical 
Analysis System (SAS) statements are employed it the 
programs to accemplish the desired functions. These SAS 
statements are normally preceded by comments to explain 
their purpose, the ccaments being identified by an asterisk. 

Table XI cortairs the pregram called "ADSETUP". This 
program was used to access the master file and extract 
informaticr on Aviation Machinist's Mates (AD). (The master 
file tape, originally called "ENLIST", has recently been 
revised and relabeled "NPS7 09"-) The data file created by 
this program is called "ADDATA", and it contains the iritial 
243 variables from the master file. Also provided in the 
program are the variable names and labels. The program may 
be used to extract data from the master file for any ox 
approximately 90 Navy ratings simply by entering the appro- 
priate abbreviation and four digit code for the selected 
ratine. 



Table XII provi 
was used to screen t 
These screens were p 
file, and the re 
"ADSDESET". Becaus 
ables in the data, 
available. Therefo 



des the 
he data 
erf orme 
suits 
€ of th 
suf f ici 
re, th 



program called "ADSCREIN" that 
extracted from the master file, 
d on observations in the "AEEATA" 
were placed in a file called 
e large number of cases and vari- 
ent computing work space was not 
e SAS KEEP statement was used to 
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retain 116 cf the iritial variables for analysis. It was 
felt these 116 variables captured all the desired neasures 
CD the cbservations that would be required for analysis. 
The last screen was incorporated following frequency distri- 
bution analysis to reaove cases that had aberrant or inpcs- 
sible data associated with then. 

lable XIII contains the program called "ADNEWVAS”. Ibis 
program was employed to create new predictor and criterion 
variables, as discussed in Chapter III. The prcgian used 
information on observations in the "ADSOBSET" data file to 
create tie new variables, and placed the results cf these 
cperaticrs in a file called ••ADALL4". This file thus 
constitutes the AD data set referred to throughout this 
study. It contains all of the selected and created vari- 
ables that provide information on the 2820 ADs who remained 
in the data set after all screens were accomplished. It is 
this file that was used to conduct the statistical analysis 
for this study. 

The "AENEWVAE'’ program lists all created variable names 
and labels. It also contains the SAS statements that 
converted several qualitative variables to numeric variables 
cr dichotcicus (0,1) variables. finally, the program shews 
the SA5 statement used to split the AD data set into the two 
unifcrmly distributed random samples (BANDA1L1). These 
derivation and validation samples were used during regres- 
sion and discriminant model development described in Chapter 
IV. 



60 



TABU XI 

frogram to Eatract Bata from the Master File 



(2807,0 110) 
28C7& 



•D OSIUND, SMC 1763' ,CLASS=K 



//ADIAIA JCB _ 

//*MA3N C£C=N^GVfJ1 
y / F X I C S A S 

y/SAS-HCfK ED SP ACE= JCIL, ( 1 2 , 4) ) 

//FIII3N EE UNII=34CC-5,V01=SE£=EN1IST, 
yy EI£P=C1E,DSN=EN1S'1.ALL, A7678 

y/FIlECOI EE UNI1=3320V,MSVGF=PUB4B,DISP= (NEW,CAT1G,EE1FIE) , 
y/ ESE=MSS.S28(7.ADDA1A, 

y/ £CB= {ELKS12Z=640 0) 

y/SYSIN EE * 

CPTICliS 1£=80 NOCENIIF; 

EATA FIlECd.ADDATA; 

* THI£ EfCGEAM EXTEACIS NEAEIY All THE VARIABIES FEOE TEE 
MASIEE EIIE, AND HEITES GOT A FIIE TO MASS STORAGE 
WHICH CCNIAINS All THESE VARIABIES FOR All CASES WHICH 
HAE ANYTHING TC DO WITH THE 'AE' RATING.; 



INFIIE 

INEOI 



FIl EIN: 



3 


5 


CENSOSRG 


PIE1. 2 


6 


CENSCSDS 


PIB1. 2 


7 


HOMEZIP 


PIB3. 


a 


10 


EtlSTATE 


PIE1. a 


1 1 


DAIEEETY 


PIB1. 2 


12 


lATEDETM 


PIB 1. 


a 


13 


EIBTKYfi 


PIE1. 2 


14 


BIRTHMTH 


PIB1. 2 


15 


BIRTHDAY 


PIB 1. 


a 


16 


ENIf YAGE 


PIE1, 2 


17 


RECCEEID 


PIB1. 2 


18 


HYEC 


PIEl. 


S 


19 


SEX 


PIB1. 2 


20 


RACE 


PIB1. 2 


21 


ETHNIC 


PIS 1. 


a 


22 


RACEEIEN 


PIE1. 2 


23 


MRIIDPND 


PIB1. 2 


24 


TESTFCEM 


PIB 1. 


a 


25 


AFCTPCNI 


PIEl. 2 


26 


AFQIGEPS 


PIB1. 2 


27 


ASVAEGI 


PIBl. 


2 


28 


ASVAENC 


PIE1. 2 


29 


AS VAEAD 


PIB1. 2 


30 


ASVABWK 


PIBl. 


a 


31 


ASVAEAR 


PIE1.' 2 


32 


ASVABSP 


PI31. 2 


33 


ASVABMK 


PIEl. 


a 


34 


ASVABEI 


PIE1. 2 


35 


AS VAEMC 


PIB1. 2 


36 


ASVABGS 


PIE 1. 


2 


37 


ASVABSI 


PIE1. 2 


38 


AS VAEAI 


PIB1. 2 


39 


SERVACCS 


PIBl. 


a 


40 


IRICRSRV 


PIE1. 2 


41 


POl 


PIB1. 2 


42 


HES 


PIE 1. 


a 


43 


ASVABCM 


PIE1. 2 


44 


AS VAECA 


PIB1. 2 


45 


ASVAECE 


PIB 1. 


2 


46 


ASVABCC 


PIE1. 2 


47 


ENTEYSTA 


PIBl. 2 


48 


HEIGHT 


PIEl. 


2 


49 


HEIGHT 


PIB1. 2 


50 


SYSICIBP 


PI31. 2 


51 


CIA5T1BP 


PIB 1, 


2 


52 


HEEIAI11 


PIE1. 2 


53 


MEEEAI12 


PIBl. 2 


54 


MEDFAI13 


PIEl. 


2 


55 


KAIVER 


PIE1. a 


56 


WAIVZEAl 


PI31. 2 


57 


EXAMSIAT 


PIEl. 


2 


58 


EKIEYYR 


PIEl. 2 


61 


TEBMENII 


PIBl. 2 


62 


ENTRPA YG 


PIBl. 


2 


59 


ENTEYMTH 


PIE1. 2 


60 


ENTEYEAY 


PIBl. 








2 


63 


ECMECNTY 


PI £2. a 


65 


PROGENII 


PIB5. 2 


72 


AFEESSIA 


PIBl. 


2 


73 


ECNCSCPT 


PIEl. 2 


74 


ENISICPT 


PIBl. 2 


75 


YOUTHPRG 


PIBl. 


2 


78 


Tlf HATE 


PIEl. 2 


81 


TRENItOS 


PIB5. 2 


86 


IAFMS1 


PIE2. 


2 


88 


EfCCI 


PIE2. 2 


90 


DDCC1 


PIB2. 2 


92 


HYEC 1 


PIB 1. 


2 


93 


PAYGRDE1 


PIEl. a 


94 


SEfiVICEl 


PIBl. 2 


95 


HRTSTAI 1 


PIE 1. 


2 


96 


MEKDNT1 


PIEl. a 


97 


SP NSPE1 


PIB3. 2100 


ISC1 


PIBl. 


210 1 


SEPBT1 YR 


PIEl. 2102 


SEPRI IMT 


PIBl. 2103 


SEPEI1DY 


PIB 1. 


2 


10 4 


EASE1 YE 


PIEl. 2105 


BA SD1BTH 


PIBl. 2106 


BASD1DAY 


PIBl. 


2107 


ETS1YE AR 


PIEl. 2 


108 


ETS1MNTH 


PIBl. 








2109 


EC1E1 YR 


PIEl. 2110 


DOIEIMTH 


PIBl. 








2113 


f EBE1YE 


PIEl. 2 


114 


PEEE1MTH 


PIBl. 2115 


PEBD1DAY 


PIEl. 


211 1 


CHAESRV1 


PIEl. 2 


112 


EIGREUPI 


PIBl. 








2116 


EI1EF1G1 


PIE2. a 


118 


TA E MS2 


PIB2. 








2120 


EECC2 


PIE2. 2122 


DDCC2 


PIB2. 2124 


BYEC2 


PIB 1. 


2 


125 


EAYGREI2 


PIEl. 2126 


SEEVICE2 


PIBl. 2127 


MRTSIATE 


PIE 1. 


2128 


KEENENI2 


PIEl. ai29 


SPNSPD2 


PIB3. 2132 


ISC 2 


PIE 1. 


2133 


SEPET2YE 


PIEl. 2134 


3EERT2HT 


PIBl. 2135 


SEPRT2DY 


PIEl. 


2136 


EASE2YB 


PIEl. 2 


137 


BA SD2MTH 


PIBl. 2133 


BASD2DAY 


PIB 1. 


2139 


ETS2YEAR 


PIEl, 2140 


ETS2MNTH 


PIBl. 








2141 


EC1E2YR 


PIEl. 2142 


D01E2BTH 


PIB 1. 








2145 


f EEE2YE 


PIB1. 2146 


PEED2MTH 


PIBl. 2147 


PEBD2DAY 


PIEl. 


2143 


CEAESR V2 


PIEl. 2 


144 


E1GREUF2 


PIBl. 
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2148 


IIIEFIG2 


PIE2. 2150 


TA EMS3 


PIB 1. 




2151 


TAEi'S4 


PIE1. 2152 


DPCC3 


PIB2. 3154 


DDOC 3 


2156 


EYEC3 


PIE1. 2157 


PAYGEEE3 


PIB1. 3158 


SERVICE3 


2159 


MRTSTAT3 


PIE1, 2160 


NDENENT3 


PIB1. 3161 


SPNSPD3 


2165 


SEPRT3YR 


PIE1. 2166 


SEPRI3MT 


PIB1. 3167 


SEPRT3DY 


2168 


EASI3YR 


PIE1. 2169 


BASD3CTH 


PIBI. 3170 


EASD3DAY 


2171 


ETS3YE AE 


PIE1. 2172 


ETS3MNTH 


PIB 1. 




2173 


ECLE3YE 


PIE1. 2174 


DOIE3MTH 


PIBI. 




2177 


PEEE3YR 


PIE1. 2178 


PEED3MTH 


PIBI. 3179 


PEBD3DAY 


2164 


ISC3 


PI 


El. 








2175 


CHAESE V3 


PIE1. 








2176 


EIGREDP3 


PIE1. 2180 


FIIEFIG3 


PIB2. 




2182 


EIIEMTCH 


PI 


E4. 2186 


DOEYEXEP 


PIBI. 3187 


EOEMTDEP 


2188 


KNTESDEP 


PI 


El. 2189 


SPiLGtiL 


PIBI. 




2190 


ECPGYE 


PI 


El. 2191 


DCPGMNTH 


PIBI . 




2212 


GCT 




2.- 2214 


ARI 


2. 3216 


MECH 


2218 


Cl EE 




2. 2220 


AFQTS 


2. 3222 


PNEC 


2227 


CTZNSHIP 




51. 








2229 


PEIEEPND 




51. 2230 


SECDEPND 


$1. 3231 


ERCL 


2233 


GRCUPIKD 




$1. 2234 


AUTHEATE 


$4. 3240 


EDPGYR 


2244 


SCHICCEE 




51. 2245 


SCHIWVR 


$ 1. 3246 


ASTAE 


2247 


TSSINE 




$1. 2250 


PRESEATE 


$4. 




2254 


RUMPG1 




51. 2255 


PRETAEEV 


53. 3258 


EXAMRATE 


226 2 


KUMPG2 




51. 2263 


EXETAERV 


$3. 3266 


TOTLEAW 


2269 


STENAVY 




2. 2272 


PRCODE 


$2. 3274 


ALTPRCDE 


2276 


FIMMOIT 




5. 2281 


FNMITCUT 


5. 3287 


PRFFACTE 


229 0 


AHIEACTR 




2. 2292 


CHKGEATE 


$1. 




2296 


EATEINX 




51. 2297 


SP PRCIND 


5 1. 3298 


TYPSNLST 


230 1 


MCDEST 




51 . 2302 


NENISIMT 


1. 




2303 


EACS YYMMDE6. 2309 


TA £ 


54. 3313 


CAS 


2317 


ICSCCBE 




51. 2318 


LOSWVE 


51. 3319 


SIPG 


2323 


TIEfiVE 




$1. 2324 


TIB 


54. 




233 6 


AEEE YYMMDE6. 2343 


EDPG YYMMDD6. 3349 


DTIS 


2352 


EECEOEES 




1. 2356 


NCHANGES 


3. 33 84 


AGE 


238 6 


KHRCGCT 




2. 2388 


NHRCAFQT 


2. 3390 


MENTIGRP 


2391 


EDCEETIF 




51. 2392 


MOELESGN 


51- 3394 


EYNDPNDT 


239 6 


Gf P4PECG 




52. 2398 


SSDDTY 


5 1. 3399 


REGRESEV 


2 40 0 


EYPAYGED 




51. 2401 


NOTECMD 


$ 1. 3402 


SSNCHNGE 


2 40 3 


TCTEECMO 




2. 2405 


TOTIDEMC 


1. 3406 


lOTLAKOI 


2407 


TCTEESET 




1. 2408 


TOT MI TCN 


1. 3409 


TOTCVICN 


2412 


IKGTHSEV 




54. 2416 


SCREEN 


2. 3418 


ATTRITCE 


2419 


EECNTC 




51. 2420 


RECENISI 


52. 3422 


HECPECGM 


2423 


FICPEGSC 




52. 2425 


RCPGSCRT 


54. 3435 


ZLSTHIST 


2436 


KEAYSE2 




4. 2440 


NDAYSE3 


4. 3444 


NDAYSE4 


2449 


EMECRATE 




53. 2452 


DM DC NEC 


54. 3456 


DMDCUIC 


2462 


CCNVE ATE 


YYMMEE6. 




3468 GEA 


.XDATE YYM 


2474 


TEAEEATE 


Y YMMDD6. 








2480 


EAENNEC $4. 


2484 


TR AININD 


51. 3485 


STACIION 


lABEI 












CENSC 


ISEG=C£NSUS 


REGICK ( 


10 CODES) 






CENSt 


iSIS=CEKSOS 


DISTEICI 


(5 CCEES) 






EOMEZIP =HCME 


OF 


RECCEE 


ZIP CCES 






EMESTATE = HC ME 


OF 


RECCEE- 


-STATE 







IATEI£Ii: = YEAR OF FINAl QUALIFYING DETERMINATION 
CATEL£TK=MCNTH OF FIbAX QOAIIFYING DETERMINATION 
EIRTHYE =Y£AR OE BIETE 
EIRTHeiH=MCKTH OE EIEIH 
EIRIHrAY=EAY OF EIRTE 

£NTRYAC-I = AG£ OF INDIVIDUAL AT TIME OF ENTRY 
EECCEIID=E£CORD ID — liAM SCORE, EEP, ACTIVE DOTY 
EYEC ^HIGHEST YEAR OE EDUCATION 

SEX =(1) MALE, (2) FEMALE 

RACE =jli WHITE. (^) BLACK, (3) OTHER 
ETHNIC =INriVIDUAL* ’S REPOETrE ETHNIC STATUS 
EAC£ETEN=SIX RACE-ETENIC COMBINATIONS 
i!ETLDENE = M AEITAL S TAT US/DEP END ENTS 

TESTFCEM=TEST FORM/ECEA, ASV AE , Af KST , AFQT ,OSE 

AFCTECKT=AFCT PERCENTILE (OR EQUIVALENT) 



PIE2. 
PIB 1. 
PIE3. 
PIE 1. 
PIE1. 



PIB1. 



PI31. 



$4. 

$ 2 . 

$4. 

$ 1 . 



$4. 




$ 2 . 



.54. 

54. 

3. 

2 . 

$ 1 . 

2 . 

5 1 . 
$ 1 . 
1 . 
1 . 

51. 
$ 1 . 
51. 

4. 

5 6. 
MED6. 

$ 1 . ; 
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iFQIGff £ 

ASVAEGI 

ASVAEKC 

ASVAEAI 

ASVAESK 

ASVAEAE 

ASVAEEE 

ASVAEEK 

ASVAEEI 

ASVAEMC 

ASVAEGE 

ASVAEEI 

ASVAEAI 

SERVACCE 

fBICREf V 

iUL 

EES 

ASVAECM 

ASVAECA 

ASVAECI 

ASVAECC 

INTEISIA 

EEIGEI 

REIGBI 

EYSICIEf 

IIASHEf 

CEDFAIil 

MEDFAII2 

HEDFAJ13 

liAIVEE 

SAIVEEAl 

EXAHEIAI 

lESilEMI 

ENTREA^G 

HOMECJIS 

fROGEKlI 

AFEESSIA 

EONUECf I 

ENISICPI 

YCUIEf EG 

lAPEI AIE 

lEENlECE 

IAFMS1 

EPOC1 

EEOC 1 

HYEC1 

PAYGEEE1 

SEEVICE1 

HRTSIAH 

NBPNEM1 

EPNSPE1 

ISCl 

EEPEI 1YE 
SEPEI 1M 
EEPEI IE Y 
EASE 1 YE 
EASE 1EIH 
EASE II AY 
EIS1YEAB 
EIS1 HEIH 
CHAEEEV 1 
E1GEEQP1 
PEED 1 YE 
PEBB 1£IH 
PEBE 1IAY 
ENTRY YE 
ENTEYKIH 
ENTE YI A Y 
EEPEI 1YE 



■&IQ1 GROUPS 
= AEVAB APTIIDIE 
APTITUIE 
APTITEIE 
APTITUIE 
APTIIDIE 
APTITDIE 
APTIIDIE 
APTITDIE 
APTIIDIE 
APTITDIE 
APTIIDIE 
APTIIDIE 



=ASVAB 
'AEVAB 
-AE VAB 
aSVAB 
'AEVAB 
= AEVAB 
=AEVAB 
'AEVAB 
:AEVAB 
= AEVAB 
=ASVAB 



(5y4C, 4E,4A,3B,3A,2, 1) 

SCORE — SOBSCAIE 



AREA SCORE — SOBSCAIE GI 
AREA SCOEE — SOBSCAIE NO 
AREA SCORE — SOBSCAIE AD 
AREA SCOEE — SOBSCAIE WK 
AREA SCORE — SOBSCAIE AR 
AREA SCOEE — SOBSCAIE SP 
AREA SCORE — SOBSCAIE NK 
AREA SCORE — SOBSCAIE El 
AREA SCORE — SOBSCAIE MC 
AREA SCOEE — SOBSCAIE GS 
AREA SCOEE — SOBSCAIE SI 
AREA SCOEE — SOBSCAIE AI 
= SEEVICE OF ACCESSION (NAVY.2J 
=P£IOR SERVICE (NON-PBIOE SERVICE,!) 
=GEN. HEAITH, UPPER S ICWER EXTEEMITIES 
=BEARING, VISION, PSYCBIAIEIC WEII-BEING 
=ASVAB APTIIDIE AREA SCOEE — SOBSCAIE CM 
APTITDIE AREA SCOEE--SOBSCAIE CA 
APTITDIE AREA SCORE — SOBSCAIE CE 
APTITDIE AREA SCOEE — SOBSCAIE CC 
SIAIOS (l-DIEECI TO ACTIVE DUTY) 
IKCHES (FEACTICNS 



ASVAB 
=ASVAB 
= ASVAB 
= ENIRY 
^EEIGHT 



IN 



DfiOPPED) 



'WEIGHT IN PCDNDS (FRACTIONS ROUNDED) 
:E1CCD PFESSDPE — SYSICIIC 
= EICOD PRESSDEE— DI ASICIIC 
'PRIMARY MEDICAIIY BIS tU AIIFYIN G DEFECT 
'SECONDARY MEIICAIIY DISQOAIIFYING DEFECI 
'lEBIIAEY MEIICAIIY DISQO AIIF YING DEFECT 
'PERMIT CODE ECE AN OIBERHISE INEIIGIEIE 
'WAIVER APPECVAI lEVEI AND EXPIANATION 
'EXAMINATION SIAIOS (1,F0IIY QOAIIFIEB) 
'lEEM OF ENIISIMENT ]NC, OF YEARS) 

'ENTRY PAY GEABE (EOO — Oil) 

'HCME OF RECCE! COUNTY — FIPS 
'PECGRAM ENIISTED FCE — SERVICE ONICOE 
'MIIITARY ENIEANCE PROCESSING STATIONS 
'ECKOS OPTION, CCMBAI C£ NON-COMBAT 
'ENIISIMENI OPTION 

'YCOIH S RESERVE TRAINING PROGRAMS 
'MONTH OF FIIE ON WHICH RECORD SUBMITTED 
'CCCOP. SPECIAI. /RATING CHOICE UPON ENTRY 
'MCNIHS OF ICH. ACTIVE FED. MIIIT. SERV. 
'I.C.D. PRIMARY OCCOPAIICN CODE 
'E.C.D. DOTY CCCUPAIICN CODE 
'HIGHEST YEAR CE EDOCATICN 
'PAY GRADE AS-OF-DA lE-CF-FIIE/SEPAEATION 
'SERVICE CODE (2, NAVY) 

'MAEITAl STAIDS (1, OTHER, 2, MARRIED) 

'NUMBER OF DEPENBENIS (1, NONE) 

'SEPARATION PROGRAM DESIGNATOR 

'INTER-SERVICE SEPARAIICN CODE 

'YEAR OF SEPARATION (2ND DMDC SECTION) 

'MONTH OF SEPARATION (2ND DMDC SECTION) 

'DAY OF SEPARATION (2ND DMDC SECTION) 

'YEAR OF ACTIVE DUTY EASE DATE 

'MCNTH OF ACTIVE DOTY EASE DATE 

'DAY OF ACTIVE COTY EASE DATE 

'ESTIMATED YEAR OF FOIFIllED ACTIVE DOTY 

'ESTIMATED MCMH OF FOIFILLED ACTIVE DOTY 

'CHARACTER OF SERVICE 

'REENLISTMENT ELIGIEILITY 

'YEAR OF PAY ENTRY EASE DATE 

'MONTH OF PAY ENTRY EASE DATE 

'DAY OF PAY ENTRY BASE DATE 

'YEAR OF ENTRY TO ACTIV E/D. E. P. 

'MONTH OF ENTRY TO ACTIVE/D. E. P. 

'DAY OF ENTRY TC AC TIV E/D. E- P. 

'YEAR OF SEPARATION (2ND DMDC SECTION) 
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'EPBI 1«I 

fEPEI 115 

EASE 1 5E 

EASE UIH 

EASEII^l 

ETS1 YEAE 

EISIMKIH 

PIBD 1YE 

PE3D1E1H 

PEBE U£Y 

IILEEIG1 

PIBD2YE 

PEBE2EIH 

PEBE2IiY 

SEPEI2YE 

£EP£I2i!3I 

SEPBI2E Y 

EASE25F 

EASD2EIH 

EASDlLkY 

11S2YIIE 

EISEMEIE 

PEBE2YE 

PEBE2£IH 

PEBD2liY 

IAFMS2 

EP0C2 

ED0C2 

BYEC2 

PAIGEEE2 

SEE7ICE2 

I!EISIAa2 

NDPNEK'12 

SPNSfE2 

ISC2 

CHAESE72 

E1GEEDP2 

IILEEIG2 

PEBD35E 

PEBE2E1K 

PEBE3I ZY 

SEPBI3YE 

SEPElSii’I 

SEPE13E Y 

EASE3YP 

EASDSEIH 

EASE3IZ Y 

ETS3YEAE 

ETS3£1K'IH 

PEBE3 YE 

PEBE3KIH 

PEBD3EA Y 

5APMS3 

IAFMS4 

EP0C3 

ED0C3 

BYEC3 

PAYGEIE3 

SERVICE3 

EETSIZI3 

KDPNEK73 

SPNSPE3 

ISC3 

CHAESBV3 

ELGEEDP3 

IILEE1G3 

FILEillCE 

EOEYEIEP 

EOEMIEEP 



= MCETH OF SEPZEAHON (2ND DMDC SECIION) 

= EZY OF SEPABZUCN (2NE EMDC SECTION) 

= YEAE OF ACTIVE DUTY EZSE DATE 

=MCNTH OF ACIIVE DUTY EASE DATE 

=EAY OF ACTIVE EUTY EASE EATE 

=ESIIHATED YEAE OF FOlFIllED ACTIVE DUTY 

=E£TIi1ATED MCMH OF FUIFILIED ACIIVE EUTY 

=YEAE OF PAY ENTEY EASE DATE 

=MCNTH OF PAY ENTEY EASE DATE 

=EAY OF PAY ENTEY BASE DATE 

=FIIE FLAG NC. 1 

=YEAE Of PAY ENTRY EASE DATE 

=MCNTH OF PAY ENTRY EASE DATE 

=£AY OF PAY ENTRY BASE DATE 

=YEAE OF SEPAEATION (3RD DUDC SECTION) 

=NCNTH OF SEPAEATION (3EE DilDC SECTION) 

=EAY OF SEPAEATION (3^E DMDC SECTICN) 

= YEAE OF ACTIVE DUTY EASE DATE 

=MCNTH Of ACTIVE DUTY EASE DATE 

=EAY OF ACTIVE DUTY EASE DATE 

=ESTIHATED YEAR Of FULFILLED ACTIVE DUTY 

= ESTIMATED MCMH OF FUIFILLED ACTIVE EUTY 

=YEAE OF PAY ENTRY EASE EATE 

=MCNTH OF PAY ENTRY EASE EATE 

=EAY OF PAY ENTEY BASE EATE 

= MCNTHS CF ICTL. ACIIVE FED. MILII. SEEV. 

= D-C.D. PRIMARY OCCOPATICN CODE 

=D.C.D. EUTY CCCUPAIICN CODE 

=HIGHESI YEAE OF EDCCAIION 

=PAY GRADE AS-CF-EA I E-CF-FILE/SEPARAT ION 

= SEBVICE COEE 12, NAVY) 

= MAEITAL STATES (I.CTEEB, 2.MAERIED) 

= NUMBEE CF DEPENDENTS (1, NONE) 

=SEPARATIOH PFCGEAM EESIGNATOR 

=INTEE-SEEVICE SEPAEATION CODE 

= CEARACTER OF SERVICE 

=EEENLISIMENI ELIGIBILITY 

=FIIE FLAG NC. 2 

=YEAfi OF PAY ENTEY EASE EATE 

=MCKTH OF PAY ENTRY EASE DATE 

=EAY OF PAY ENTEY BASE EATE 

= YEAE OF SEPARATION (4IH DMDC SECTION) 

= MCNTH OF SEPARATION (4TH DMDC SECTION) 

=EAY OF SEPARATION (4TH DMDC SECTICN) 

=YEA£ OF ACTIVE DUTY EASE DATE 

=MCNTH OF ACTIVE DUTY EASE DATE 

=EAY OF ACTIVE DUTY EASE DATE 

=ESTIMATED YEAE OF FUIFILLED ACIIVE DUTY 

=ESTIMATED MCNTH OF FUIFILLED ACIIVE EUTY 

=YEAE OF PAY ENTRY EASE EATE 

=MCNTH OF PAY ENTRY EASE DATE 

=EAY OF PAY ENTRY BASE EATE 

= MCNIHS CF TCTL. ACIIVE FED. MIIII. SEEV. 

=MCNTHS CF TCTL. ACIIVE FED. MILIT. SEEV. 

=E.C.D. PRIMARY OCCUPATION CODE 

=E-C.D. EUTY CCCUPAIICN CODE 

=BIGHEST YEAE OF EDUCATION 

= PAY GRADE AS-CF-DA T E-CF-fILE/SEPARATION 

= SEEVICE CODE (2, NAVY) 

= MAEITAL STATES (1.CTBER, 2, MARRIED) 

= NUMBEE OF DEPENDENTS (1, NONE) 
=SEPARATION PROGRAM EESIGNATOR 
=INTEB-SEEVICE SEPAEAIICN CODE 
=CBAEACIER OF SERVICE 
=REENLISTMENT ELIGIEIIITY 
=FIIE FLAG NC. 3 

=4-EYTE BINARY FILE MATCH INDICATORS 
=DOE YEAE INTO D.E. P. 

=DCE MONTH INTC D.E.P. 
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BNTHSrif^ 

£PF1GI!1 

ICPGMNTH 

XCPGIB 

GCT 

ABI 

MICH 

CLEB 

PNEC 

CIZNIEJP 

EECl 

GBOOPIND^ 

AUTHEill^ 

IDPGII 

ICHICCIE^ 

SCHIKVB ^ 

PBESEAII^ 

PBRIjIIIV 

IXAMEJiTE^ 

EXEIAIEV^ 

30T1EAW ^ 

SIDBAVY : 

PECOEE 

AITPBCDI^ 

IINIKCII^ 

ENMIICDI 

PEFIACIE^ 

AWIFACIE 

CHNGEAIE^ 

KENISIMI 

lAOS 

IAS 

CAS 

SIPG 

lOSCCCI ^ 

lOSHVE 

“HErSE 

3IR 

AEBD 

EEPG 

ETIS 

KCHANGES^ 

AGE 

NH3CGC1 ^ 

NHRCAICI^ 

EENTlGEf^ 

EECEE!III^ 

EOBIESGK^ 

HYNEPKEI^ 

GRP4EECG^ 

SSDD15 

EEGBISR 

HYPAYC-EE^ 

NOTECED 

SSNCEEGI^ 

lOTPECEC^ 

lOTLEIMC 

lOTlAKCl 

EOTDISEl^ 

lOTMlJCN 

lOTCT/lCN 

INGIHSEV 

SCREES 

AITRIICE^ 

EECNIC 

EECESIS3 

EECPECGM 

EECPBGSC 

ECPGSCEa 



= MCNTHS IN D.E.P. 

= SPANISH FLAG MASTEE/ICSS 
= MCSIH Of DCPG 
= YEAE OF DCPG 
'EASIC BiTTEEY GCT 
=EA£IC BflITEEi ARI 
=EASIC BATTEEi MECH 
= EA£IC BATTEEI CIEB 
= NATiY ENilSTEE JOB CODE 
=CIIIZEN£HIP CODE 
= ERANCH/CLAS£ 

= GECUP INDICATOR 
'ADTHORIZED BATE (ABBE.) 

=EFfECTI-VE DATE OF PAY GRADE 
'SCHOOl CODE 
'SCHOOL KAIVEE 
'PRESENT RATI CODE 
'EEESENI RATE (AEBR.) 

'EXAMINATION BATE CCDE 
'EXAMINATION BATE (ABBE.) 

'TOTAL RAW SCCEE 
'STANDAREIZEI NAVY SCCEE 
'PROCESS CODE 
'ALTERNATE PECCESS CCDE 
'CANDIDATE’ * S FINAL MHITIPLE 
'FINAL MCLTIPIE CUT 
'PERFORMANCE fACTOR 
:AliI FACTOR 

'CHANGE OF RATE INDICATOR 
'NUMBER CF EMISTMENTS 

'EXPIRATION Cf ACTIVE CBLIGATED SERVICE 
'TCTAL ACTIVE SERVICE 
'OTHER ACTIVE SERVICE 
'SEFVICE IN PAY GRADE 
'LENGTH CF SEFVICE 
'LENGTH CF SERVICE WAIVER 
'TIME IN RATE WAIVER 
'TIME IN RATI 
'ACTIVE DUTY EASE DATE 
'EFFECTIVE BATE OF PAY GRADE 
'DRILL TIME IN SERVICE 
'NUMBER CF CHANGES/ ENTRIES IN NHEC 
'CANDIDATE' ’ £ CURRENT AGE 
'NHEC FIIE''£ GENEL. CIASSIFICAIION 
'NHEC FILE''£ ARMED FCBCES QUALIFY, 
'MENTAL GROUP CODE 
'EDUCATION CERTIFICATE 
'MIIITAEY OEIIGATION DESIGNATOR 
'HIGHEST NUMEER CF PRIMARY DEPENDENTS 
'GECUP IV (100K) PROGRAM CODE 
'SEA-SHORE DITY INDICATOR 
'EEGULAE RESERVE INDICATCB 
'HIGHEST PAY GRADE 

'NCT RECCMMENIED FOE EI-ENLIS TMENT 
'SOCIAL SECUEITY/NAME CHANGE 
'TCTAL PEOMOTICNS 
'TCTAL DEMOTICNS 
'TCTAL UA/AWCl 
'TOTAL DESERTIONS 
'TCTAL MILITARY CONFINEMENTS 
'TOTAL CIVILIAN CONFINEMENTS 
'LENGTH CF SERVICE 
'SCREEN SCORE 
'ATTRITION INDICATOE 
'RECRUIT NAVAI TRAINING COMMAND 
'RECRUIT TYPE ENLISTMENT 
'BECRUIT PROGRAM AT ENLISTMENT 
'EECEUIT PROC-RAM/SCBCCI 
'RECRUIT PROGEAM/SCHOCI RATE 



FILE 

TEST 

TEST 
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Z1SIKI£1=EKIISIEE HIS3CEY SIAIDS 
KCAY5Z2 =CCKPUIEE NOKZEE OF EAiS 10 E-2 EAIING 
KEAYSE3 =CC£PUIEE NUCIIE OF EAYS 10 E-3 BATING 
NEAYSia =CCtPUTEE NUJ5EEE OF EAYS TO E-4 EAIING 
XOLElYi =YEAR OF LAIFSI EE- E NIIS I MENT 
EOXE 1KIH=£!CNTH OF LATEST EE-ENIISIMEN T 
E0LE2YE =YEAE OF LATEST BE- ENLIST ilENI 
E0LE2«IH=MCNTH OF LATEST BE-ENIISIHENI 
E0LE3YE =YEAE OF LATEST EE- ENLISTMENT 
E0LE3l'TH = MCNIH OF LATEST RE-ENIISIMEN T 
EMDCEATE=FINAL EATING AS LISIEI BY D.M.E.C. 

EMDCNEC =FINAL N.E.C. AS LISIEE BY D.M.E.C. 

EMDCOIC =FINAL O.I.C. AS LISIEE BY D.M.E.C. 
CONVEAT-E=CCNVENING DATE FOE NITRAS COURSE 
GEADrAIE=GEADDAIION EATE FOE NIIEAS COURSE 
IEANEAIE=TBANSACIION EATE FOE NIIEAS RECORD 
EAENNEC =DIE CANDIDATE EARN AN NEC? 

TRAININE=TEAINING INDICATOR 

STACI3CN=STDDENI ACTICN CODES (PASS, P, ETC.) ; 

* THIS SCREEN SELECTS ONLY THOSE CASES WHICH HAD ANY 
AFFIIIAIICN WITH TEE 'AD' RATING. THAI IS, IHD SE CASES 
WHICH AEZ LISTED IK THE DMEC FILE AS PRESENTLY AD'S 
(PEEIAEEVJ OR AS FINALLY AD'S (DMDCRAIE) , OR AS SIGNING 
UP FCE AI^S (ECPGSCFT), OR AS HAVING TAKEN THE AE 
RATING EXAMINATION (EXAMRAIE).; 

IF EMDCRATE=' AD ' OR PR ETAFE 7=' AD ' OR 
ECPGSCFT='6200' CE EXA MEAIE= • 620 0 ' ; 

* THIS NEXT SECTION CtTPUTS BASIC FREQUENCIES TO CHECK 
THAT TEE RATING SPICIFIC EAIA HAS BEEN WRITTEN CNTC 
TEE FILE IN MASS SICBAGE. ; 

EEOC FREC EAIA=FILEOtT. ADDATA: 

lAEIES DKDCRATE PRRIABRV ECEGSCRI EXAMRAIE: 

TITLE CHECKOUT FREQUENCIES FROM THE FILE ADDATA.; 
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TAEIE XII 

Pcograa to Screen the AD Data 



//ADSCREEN JOB (2807 -C110), 
//♦aA3K CEG=NPGVal. 2fcC7P 
// EXEC SA£ 

//SAS.KCEK £D SP ACE= (CYL- ( 1 
//FIIEIN £D DISP=SHE.D£N=M 
//FIIECCI ££ UNIT=33ECV- aSV 
// £ISP=(NEW, CAI1G,£E1£IE) 
// £C£= (B1KSIZE=640C) 

//SYSIN ID * 

CPTICfS 1S=60 NOCENIEE; 



•D CSLUND, SMC 176 3' ,C 1 A££=E 



2t^) i 

SS.S2807.ADDAIA 

GP=EUE4A, 

,DSli=MSS.S2807. ADSUBSE I, 



* THIS PEOGEAM REDUCES THE KUMEEE OF CASES IN THE DATA 
SEl EY SCREENING ON CERTAIN VARIABLES. THE INTENT 
OF TEE SCREEN IS SUEMARIZED ABOVE THE APPROPRIATE 
SAS STATEMENTS; 



lATA EIIECUT.ADSDBSET ; 
SET FIIEIN. ADIATA ; 



♦ THE NUMBER OF VARIAELES IN THE DATA IS REDUCED TO 
REDUCE TEE WORK SPACE REQUIREMENTS.; 



KEEP 

AFCTGRPS 

ASVAEAR 

ASVABMK 

ATTRITCD 

EASI1 YR 

IMBCNEC 

DPCC3 

ENTRYAGE 

ETHNIC 

EI1EF1G1 

EYEC1 

ISC3 

MRTSTAT1 

NDPNDNT1 

PAYGRDE3 

PRFIACTR 

RCPGSCRT 

SCREEN 

SEPFT3DY 

SERVICES 

SIDNAVY 

TESTFORM 

TCTIRAK 

SAIVERAL; 



AFCTPCNl 

ASVABEI 

ASVABNC 

AUTHRATE 

CHARSRV1 

DMDCRATE 

EDCERTIE 

ENTRYDAY 

ETS1 MNTE 

FI1EFLG3 

HYEC3 

LNGTHSRV 

MRTSTAI3 

NDPNDNT3 

PEEDI DAY 

PRIORSRV 

RECENLSI 

SEPRT1IY 

SEPET3CT 

SEX 

TAFMS1 
TOTC VLCN 
TOTMLTCN 



AGE 

ASVABGI 

ASVAESI 

ARIF ACTR 

CHARSRV3 

DOLE IMTH 

ELGREUP 1 

ENTR YMTH 

ETS1 YEAR 

FINLMULT 

HYNDPNDT 

MENTLGRP 

NBAYSE2 

NHRC AFCI 

PEBD IMIH 

PRRT AERV 

EECORDID 

SEPRT1MT 

SEPRT3YR 

SPNSPD1 

TAFMS3 

TOTDESRT 

TCTPRCMC 



ASVABAD 

ASVABGS 

ASVABSP 

BASD1DAY 

DDOC1 

DOLE1 YR 

ELGBEUP3 

ENTRYSTA 

EXAMRATE 

FNMLTCUT 

HYPAYGRD 

MOBLDSGN 

NDAYSE3 

NOTRCMD 

PEBD1 YR 

RACE 

RECPRGSC 

SEPRT 1YR 

SERVACCS 

SPNSPD3 

TAFMS4 

TOTLAWCL 

TRENLMOS 



ASVAEAI 

ASVAEMC 

ASVABWK 

BASD IMTH 

DDOC3 

DPOC1 

ENTRPAYG 

ENTRYYR 

EXRTABRV 

HYEC 

ISC1 

MRTLDPND 

NDAYSE4 

PAYGRDE1 

PRESRATE 

RACEETHN 

REGRESRV 

SEFRT3DY 

SERVICEI 

SSNCHNGE 

TERMENLT 

TCTLDEMO 

WAIVER 



♦ THIS SCREEN 
DMDC FATING 



SEIECTS ONLY THOSE CASES WHOSE 
IS AD. ; 



FINAL 



IF DMDCRATE EQ *AD' ; 



* THE FCILCRING LINE SELECTS ONLY THOSE CASES WITH NC 
PRICP SERVICE. TO EURTHER REMOVE POTENTIAL PRIOR 
SERVICE CASES THOSE WHO HAVE CHANGED THEIR SOCIAL 
SECURITY NUMBER ARE ALSO REMOVED FROM THE SAMPLE.; 

IF ERICRSRV = 1; IF SSNCENGE EQ 0; 
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* THE ECllCKING STAIEHENIS SELECT ONLY THOSE CASES WEC 
WESE lESTEL ON ASVAE POEMS 5. 6 0£ 7. ALSO THOSE 
CASES KITE PECULIAPIY HIGH ASVAB SCORES ARE 
ELItINATEE FROM THE EATA SET.; 



IP 

IP 

IP 

IF 

IP 



((TESTFOEM GE 35) ANC (lESTFOBM 



5SVABGK = 1 5 
ASVABAfi<=20 
1SVABGS<=20 
ASVABHK<=30 



IF ASVABNC<=50; IF 
IP ASVAESE<=20; IF 
IP ASVAESK = 20; IF 
IP ASVAEEK = 30; IF 



LE 37)); 
ASVABAD<= 30, 
ASVABMK<=20i 
ASVABAI<=20 
ASVABHC< = 20 i 



THIS SCREEN ONIY KEEPS THCSE WHO SIGNED UP FOE 
NAVY CP NAVAL EESERVE. ; 

IP ((SEEVACCS EC 2) OH (SEEVACCS EQ 8)); 

ONIY TEOSE CASES WEC WERE KNOWN TO HAVE SIGNED 
UP POR AT LEAST FOCR YEARS ACTIVE DUTY ARE KEPI.; 

IP EECENLST EQ 'll'; 

THE CASES ARE SCREENED TO INCLUDE ONLY THOSE KITH 
•GCCI' CR 'BAD' INTEESERVICE SEPARATION CODES, 
•GREY* CASES ARE EIIMINATED. ; 



IP ISC 1=0 OR ISC1=1 

OR (ISC1 GE BO 
IP ISC3=0 OH ISC3=1 

OR (ISC3 GE 60 



£ND ISC1 LT 90) ; 
AND ISC3 LT 90) ; 



* THIS NEXT SCREEN KEEPS THCSE CASES FOR WHICH CLEAR 
'EIIGIEIE TO EEENLJST' DATA IS AVAILABLE.; 

IP iLGBEUP1=0 OB ELGRE0P1=1 OR ELGEEUPl=4 OR 
(EIGEEUP1=240 ANE (ELGREDP3=0 OR ELGEEUP3=1) ) ; 

* THESE SCREENS ELIMI^ATE CASES WITH IMPOSSIBLE DATA. 



IP AFCIGRPS NE 0: 
IP LCSKNTHS LE 7*; 
IP RACE NE C; 

IP TAFKS1 LE 72: 

IP ENTRYAGE NE /7; 
IP PACEETEN NE 0; 



IP ENTRPAYG NE 0: 

IP LNGTHSRV NE 0603; 
IP ETHNIC NE 0: 

IP LNIHSRV NE 0: 

IP APCIPCNT NE 0; 



/* 

// 
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lAElE XIII 

Prcgrafl to Create Mew Variatiles 



•D CSIUND, SflC 1763* ,:LAS£=B 



//FIIIIN ID DISP=SH£,‘DsS=M£5-^^807.ADS0BSEa: 



//ADNISVaE JOB (2807.0110) 

//♦MAIN CBG=NPGVa1. 28C7P 
// EXEC SAB 

//SAB.SCEK ED SP ACE= (CH/ (1 2 , 

-- DISP=SH£. Df 

//FIIECOI DC UNII=333CV,HSVGE=EUB4A, 

// CIBP= (NEH,CAI1G,IEIEIE) ,DBN=xMSS . S2807. ADA1L4, 

// DCE= (E1KSIZE=640C) 

//SYBIN ID * 

CPTICFB IB = 80 NOCENHF ; 

=•■ THE POfPCBE OF THIB PEOGRAM IS TO GENEEATE NEW VAS- 
lAEIES FOE USE IN TEE ANAIYSIS, EITHER BY RECODING 
ORIGINAl 1JARIAELES, OE BY CEEATING NEW VARIABLES; 

DATA FI1ECDT.ADAIL4; 

SET fllEIN. ADBUBSET; 

* THE FCILCKING IINEB_CEEATE EIEFEEENI • ENIEY GROUPS’ 

1 

2 ’ 

3 

5 



ENTRY 


EXAM 


DMDC 


YES 


YES 


YES 


YES 


YES 


NO 


YES 


NO 


YES 


YES 


NO 


NO 


NO 


YES 


YES 


NO 


YES 


NO 


NO 


NO 


YES ; 



IF 

IF 

IE 

IE 

IF 

IF 

IF 



(ECPGBCST^ 

DKDCEATE^ 

(ECPGBCET^ 

DEDCEATE 

(ECPGBCRT^ 

D1!DCEATE= 

(BCPGBCRT= 

EEDCFATE 

(RCPGBCRT 

DH£CEATE= 

(ECPGBCET 

CEDCFATE 

(ECPGBCET 

CMDCEATE= 



= •6200’ 
= 'AD* ) 

= ’620C» 
NE • AI 
:'620C 
= ’AD* ) 
:'620C* 
NE * AI 
NE ’ 62 
'•AD 
NE • 

NE 'AI 
NE '62 
= ’AD» ) 



• h 



AND 
THEN 
AND 
') TH 
AND 
THEN 
AND 
•) TH 
00’ A 
THEN 
00' A 
•) TH 
00’ A 
THEN 



AND 



AND 



EXAHEAIE=* 6200' 
ENTEYGRP=1 : 

EXA£!EAIE=' 6200' 

EN ENT£YGRP=2: 
EXAHEATE NE '6200' 
ENTEYGEP=3 : 

EXAERATE NS '6200' 

EN ENISYGRP=4; 

NC EXAMRATE='6200' 
ENTEYGRP=5 ; 

ND EXAHRATE=' 6200' 

EN ENTRYGRP=6; 

NE EXAHRATE NE '6200' 
ENIEYGRP=7 ; 



AND 

AND 

AND 

AND 



AND 



* IN THIB BECTICN, TEE DNDC VARIABLE 'HYEC IS CO N- 
VEETED TO A COKTINOCUS VAEIAEIZ REPEESENTING NUMBEE 
OF YEAES CF EDCCATICN; 



IF 

IF 

IF 

IF 

IF 

IF 

IF 



EYEC=1 
EYEC=3 
HYEC=5 
£YEC=7 
EYEC=9 
HYEC= 11 



THEN 

THEN 

THEN 

THEN 

THEN 

THEN 



CHYEC=3.5 

CHYEC=9: 

CHYEC=11; 

CHYEC=13; 

CHYEC=15: 

C£YEC=18 



EYEC=13 THEN CE5EC=11 



IF 

IF 

IF 

IF 

IE 

IF 

5; 



BYEC=2 THEN CHYEC=8: 
HYEC=4 THEN CHYEC=lO; 
HYEC=6 THEN CHYEC = 12 
HYEC=8 THEN CHY£C=14 
HYEC=10 THEN CHYEC=1 
HYEC=12 THEN CHYEC=20: 



A NIfi CATEGORICAL VARIABLE 'HSDG' IS NOW CREATED. 

A HIGH SCHOOL GEADDATE IS CCEED A '1' AND A NON HIGH 
SCHCCl GRADUATE OR A G-E. D. IS CODED '0'.; 



IF 

IF 



11 



EYEC LE 5) OE (HYEC E 
EYEC GE 6) ANI (HYEC 






THEN 

THEN 



HSDG=1; 
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• • • 



=» THIS SECIICN CEEATES NEW VARIABLES REPRESExNIINS 
SIAKIAfLIIED ASVAB SCCRES.; 



IF 

If 

If 

IF 

If 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

If 

IF 

If 

IF 

If 

IF 

If 

IF 

IF 

IF 

If 

IF 

IF 

IF 

IF 

If 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

If 

IF 

IF 

If 

IF 

If 

IF 

If 

If 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 



ASVABGI^ 
ASIAEQI- 
ASVABCI^ 
ASVAEGI 
ASVABC-I^ 
ASyAECl 
AS VAEGI 
ASVABGI 
ASVAEGI 
ASVABGI 
ASVAEGI 
ASVABGI^ 
ASVABGI^ 
ASVAEGI 
ASVAEGI^ 
ASVAEGI^ 
ASVABAR 
ASVAEAR 
ASVABAR 
ASVABAR^ 
ASVABAR^ 
ASVABAR^ 
ASVABAR 
ASVABAR^ 
ASVABAR 
ASVABAR 
ASVABAR 
ASVAEAR 
ASVAEAR 
ASVAEAR^ 
ASVAEAR 
ASIAEAB- 
ASVABAR 
ASVAEAR^ 
ASVABAR 
ASVABAR 
ASVAEAR 
ASVABSR 
ASVAESF 
ASVAESE 
ASVABSR 
ASVAESf 
AS VABSE 
ASVABSR 
AS VAESR 
ASVABSR 
ASVAESf 
ASVAESf 
ASVAESf 
ASVABSR 
ASVABSR 
ASVABSR 
AS VAESR 
ASVAESf 
ASVABSR 
ASVABSR 
ASVAESf 
ASVAESf 
ASVABMK 
AS VAEFK 
ASVABGS 
ASVABAI 
ASVAEAI^ 
ASVABAI 
ASVABAI 
AS VAEAI 



:0 THEN SAS 
THEN SAS 
= 2 THEN SAS 
= 3 THEN SAS 
= 4 THEN SAS 
=5 THEN SAS 
•6 THEN SAS 
=7 THEN SAS 
=8 THEN SAS 
:S THEN SAS 
= 10 THEN SA 
= 11 THEN SA 
= 12 THEN SA 
= 13 THEN SA 
= 14 THEN SA 
= 15 THEN SA 
= 0 THEN SAS 
=1 THEN SAS 
=2 THEN SAS 
= 5 THEN SAS 
=4 THEN SAS 
=£ THEN SAS 
=6 THEN SAS 
=7 THEN SAS 
= 8 THEN SAS 
=S THEN SAS 
= 10 THEN SA 
= 11 THEN SA 
= 12 THEN SA 
= 13 THEN SA 
= 14 THEN SA 
= 15 THEN SA 
= 16 THEN SA 
= 17 THEN SA 
= 18 THEN SA 
= 19 THEN SA 
= 20 THEN SA 
= 0 THEN SAS 
=1 THEN SAS 
=2 THEN SAS 
=3 THEN SAS 
=4 THEN SAS 
=5 THEN SAS 
=6 THEN SAS 
=7 THEN SAS 
=8 THEN SAS 
= S THEN SAS 
= 10 THEN SA 
= 11 THEN SA 
= 12 THEN SA 
= 13 THEN SA 
= 14 THEN SA 
= 15 THEN SA 
= 16 THEN SA 
= 17 THEN SA 
= 18 THEN SA 
= 19 THEN SA 
= 20 THEN SA 
= 0 THEN SAS 
=1 THEN SAS 
= 20 THEN SA 
=0 THEN SAS 
= 1 THEN SAS 
= 2 THEN SAS 
= 3 THEN SAS 
=4 THEN SAS 



VABGI= 
VAEGI= 
VABGI= 
VABGI= 
VABGI= 
VABGI= 
VABGI= 
VABGI= 
VABGI= 
VABGI= 
SVABGI 
SVABGI 
SVA3GI 
SVABGI 
SVABGI 
SVABGI 
VABAR= 
VAEAR= 
VABAR= 
VABAfi= 
VABAR= 
VABAR= 
VA£AR= 
VABAR= 
VABAR= 
VABAR= 
SVABAR 
SVABAfi 
SVABAR 
.SVABAR 
SVABAR 
SVABAR 
SVABAR 
SVABAfi 
SVABAR 
SVABAfi 
SVABAfi 
VABSR= 
VABSP= 
VABSR= 
VABSR= 
VABSP= 
VABSR= 
VAB SR= 
VABSR= 
VABSR= 
VABSR= 
SVABSP 
SVABSR 
SVABSP 
SVABSP 
SVABSR 
SVABSR 
SVABSR 
SVABSR 
SVABSP 
SVABSR 
SVABSP 
VABMK= 
VAEMK= 
SVABGS 
VABAI= 
VABAI= 
VABAI= 
VABAI= 
VABAI= 



20 

24 

zl 

30 

33 
36 

42 

45 

= 5^ 
= 54 
=57 
=60 
=63 
= 66 

H' 

zb 

27 

29 
32 

34 
36 
38 
40 

= 44 
= 46 
= 48 
= 51 
= 53 
= 55 
=57 
= 59 
= 61 
= 63 
= 65 
20 
21 

^6 

28 

38| 
40: 
= 42 
= 45 
= 47 
= 50 
= £2 
=54 
= 57 
= 59 
= 61 
= 64 
=66 
26; 
28: 
=7(5 
26; 
28 

30 
| 2 ; 
34; 



If ASVAEMK=2 THEN SASVAEHK=30; 
IF ASVABMK=3 THEN SASVAEHK=32 ; 
IF ASVAEMK = 4 IHEN S ASVABMR=35 ; 
IF ASVAEMK=5 IHEN SASVAEHK=37; 
IF ASVAEHK=6 THEN S AS V AEMR=3 9 ; 
If ASVAEHK=7 IHEN SASVAEMR=41; 
IF ASVABMK=8 IHEN SASVABHP=43; 
IF ASVAEMK=9 THEN SASVAEMR=45: 
If ASVABMK = 10 THEN SAS7ABaK=47; 
IF ASVABMK = 11 THEN SASVABi;K=4S 
If ASVABMK=12 THEN SASVJEaK=51 
IF ASVABflK=13 THEN SASVAEEK=53 
IF ASVAEMK = 14 THEN SASVAEi!K = 55 
IF ASVABMK=15 IHEN SASVAB2fK=57 
IF ASVABMK=16 IHEN SASVAB£K=5S 
IF ASVABMK=17 IHEN S£SV5£i!K = 61 
IF ASVABHK = 18 THEN SASV£BEK=63 
IF ASVABMK=19 THEN SASV^3KK=65 
IF ASVABMK=20 THEN SASV A£F.K=67 
IF ASVAEilC=0 IHEN SAS VAE;4C=25 ; 
If ASVAEilC = 1 IHEN SASVA£HC=27; 
IF ASVA£HC=2 IHEN S ASVAEHC=30 ; 
IF ASVAEMC=3 IHEN SASVAEMC=32; 
IF ASVAEMC=4 IHEN SASVA£HG=34; 
IF ASVAEHC=5 IHEN SASVAEHC=37; 
If ASVA£MC=6 IHEN SASVA£HC=39; 
IF ASVAEMC=7 IHEN SASVAEaC=41; 
IF ASVA£MC=8 IHEN SASVAEMC=43; 
IF ASVAEMC=9 THEN S AS V AEHG=4 6 : 
IF ASVABHC = 10 THEN SASVABaC = 48 
IF ASVABMC=11 THEN SASVflENC=50 
IF ASVABMC=12 THEN £ASVAEMC=53 
IF ASVABMC = 13 THEN SASVABi;C = 55 
If ASVABMC = 14 THEN SASVA3J;C=57 
If ASVABilC = 15 THEN SASVAEEC = 6C 
IF ASVABMC=18 THEN SASVA3J3C = 62 
IF ASVABMC=17 IHEN S£SVABKC=64 
IF ASVABMC = ia THEN SASVAEilC=66 
IF ASVABNC=19 THEN SASVAEEC=6S 
If ASVABMC = 20 THEN SASVAEaC=71 
If ASVA£GS=0 IHEN SASVAEGS=24; 
If ASVA£GS=1 IHEN SASVAEGS=26; 
IF ASVAEGS=2 IHEN SASVAEGS=29; 
IF ASVA£GS=3 IHEN SASVAEGS=31; 
IF ASVA£GS=4 IHEN SASVAEGS=33; 
IF ASVAEGS=5 IHEN SASVAEGS=36; 
If ASVA£GS=6 IHEN SASVA£GS=38; 
IF ASVA£GS=7 IHEN SASVA£GS=40; 
If ASVA£GS=8 IHEN SASVAEGS=42; 
IF ASVAEGS=9 IHEN SASVA£GS=45: 
IF ASVABGS=10 THEN SASVABGS=47 
IF ASVABGS=11 THEN SASVAEGS=4S 
IF ASVABGS=12 THEN SASVABGS=52 
IF ASVABGS=13 THEN SASVABGS=54 
If ASVABGS=14 THEN SASVAEGS=56 
IF ASVABGS=15 THEN SASVAEGS=5£ 
IF ASVABGS=16 THEN £ASVAEGS=61 
IF ASVABGS=17 then SASVA3GS=63 
IF ASVABGS=18 THEN SASVABGS=65 
IF ASVABGS=19 THEN SASVABGS=6S 
IF ASVABWK=30 THEN SASVASKX=64 
If ASVAEAII = 0 IHEN SASVA£AI=20; 
If ASVAEAD=1 IHEN SASVAEAL=20; 
IF ASVAEAD=2 IHEN £ASVAEA£=20; 
IF ASVAEAD=3 IHEN £A£VAEAC=21; 
IF ASVA£AD=4 IHEN SASVAEAE=24; 
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If 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

If 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

If 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

If 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 



ASVAEAI= 
A£VAFAI= 
ASVAF AI= 
ASVA£AI= 
ASVAFAI= 
ASVAEAI= 
ASVAEAI= 
A£VAEAI= 
ASVAEAI= 
ASVAEAI= 
AST?AEAI = 
ASVAEAI= 
ASVAEAI= 
ASVAEAI= 
ASVAEAI= 
ASVAEAI= 
AS1;AE£I= 
A£VA£EI= 
A£VAEEI= 
A£VAEEI= 
ASVAE£I= 
A£VAE£I= 
A£VAE£I= 
ASVAE£I= 
A£VAE£I= 
A£VAE£I= 
A£VAB£I= 
A£DAE£I= 
ASVAE£I= 
ASVAB£I= 
A£T/AE£I = 
A£VAB£I= 
A£T/AE£I= 
A£T/AB£I = 
A£VAE£I= 
A£)fAE£I = 
A£VAB£I= 
AS'yA£BK = 
ASVAEKK= 
ASVA£KK= 
A£TIAE5K = 
ASVAEKK= 
AST/AEWK = 
AS^AE«K= 
ASVA£KK= 
AS\AERK= 
AST?AEHK = 
A£ VAERK= 
AS'yAERK= 
ASVABKK= 
ASVABRK= 
A£1;ABRK = 
A£T/AEKK = 
AST/ABRK = 
A£ VAERK= 
AS'9AEHK= 
ASVAE»K= 
A£VAERK= 
A£'SAERK= 
A£'»ABliK = 
ASVABRK= 
A£VABKK= 
A£T/AERK = 
AS'VAERK = 
A£VAEKK= 
A£T/AERK = 
AS1>ABHK = 
A£T/AB»0 = 
A£T/AEliC = 



6 

7 

£ 

9 

10 
11 
12 

13 

14 

15 

16 

17 

18 

19 

20 
0 

1 

2 

3 

4 

c 

6 

7 

8 

5 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
0 

1 



4 

c 

w 

6 

7 

8 

5 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 

26 

27 

28 
29 
10 
11 



THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 



SA£ 
SA£ 
SA£ 
SA£ 
S A£ 
SA 
Si 
Si 
SA 
SA 
SA 
SA 
SA 
SA 
SA 
SA 
SA£ 
SA£ 
SA£ 
S AS 
SA£ 
SA£ 
SAS 
SA£ 
SAS 
SAS 
SA 
SA 
SA 
SA 
SA 
SA 
SA 
SA 
SA 
SA 
SA 
SAS 
SAS 
SAS 
SAS 
SAS 
SAS 
SAS 
SAS 
SAS 
SAS 
SA 
SA 
S A 
SA 
SA 
SA 
SA 
Si 
SA 
Si 
SA 
SA 
S A 
SA 
S A 
SA 
SA 
SA 
SA 
Si 
SA 
SA 



\ABAI= 
VA£AI= 
VABAI= 
\ABAI= 
VABAI= 
SVABAI 
SVABAI 
SVABAI 
SVABAI 
SVABAI 
SVABAI 
SVABAI 
SVABAI 
SVABAI 
SVABAI 
SVABAI 
VABSI = 
VABSI= 
VABSI= 
VABSI= 
VABSI = 
VABSI= 
VABSI= 
VABSI= 
VABSI= 
VABSI= 
SVABSI 
SVABSI 
SVABSI 
SVABSI 
SVABSI 
SVABSI 
SVABSI 
SVABSI 
SVABSI 
SVABSI 
SVABSI 
VABWK= 
VABWK= 
VAEWK= 
VABWK= 
VABKK= 
VABWK= 
VABWK= 
VABWK= 
VABWK= 
VABHK= 
SVABWK 
SVABWK 
SVABWK 
SVABWK 
SVABWK 
SVABWK 
SVABWK 
SVABWK 
SVABWK 
SVABWK 
SVABWK 
SVABWK 
SVABWK 
SVABWK 
SVABWK 
SVABWK 
SVABWK 
SVABWK 
SVABWK 
SVABWK 
SVABNO 
SVABNO 



36; 
38; 
40; 
42; 
44: 
= 4^ 
= 48 
= 50 
= £2 
= 55 
= 57 
= 59 
= 61 
= 63 
= 65 
=67 
20 
21 
23 

25 
28 
30 

32 
35 
37 

= 4i 
= 44 
= 46 
= 48 
= 51 
= 53 
= 55 
= 58 
= 60 
= 62 
= 65 

23 

24 

26 

27 

28 

30 

31 

33 

34 

Jit 

= 38 
= 39 
= 4 1 
= 42 
=44 
= 45 
= 46 
= 48 
= 49 
= 50 
= 52 
=53 
= 55 
= 56 
= 57 
= 59 
=60 
= 62 
= 63 
=29 
=30 
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IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

If 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

If 

IF 

IF 

IF 

If 

IF 

If 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

If 

IF 

IF 

IF 

IF 

IF 

IF 



ASVAEAD=5 

ASVA£AI)=6 

AS VAEA0=7 

ASVAEAD=8 

ASVAEAD = 9 

AS VABAD=10 

ASVABAI)=1 1 

ASVABAD=12 

ASVABAD=13 

ASVAEAD=14 

AS VABAD=15 

ASVABAD=16 

ASVABAD = 17 

ASVABAD=18 

ASVABAD=19 

ASVABAD=20 

ASVABAD=21 

ASVAEAD=22 

ASVABAD=23 

AS VABAD = 24 

ASVABAB=25 

ASVABAI) = 26 

AS VABAD = 27 

AS VABAD=23 

ASVABA0=29 

ASVABAI» = 30 

ASVAEEI = 0 

ASVAEEI = 1 

ASVAEEI=2 

ASVAEEI=3 

ASVABEI=4 

AS VABEI=5 

ASVAEEI=6 

ASVABEI=7 

AS VA£EI = 8 

ASVABEI=9 

ASVABEI = 10 

ASVABEI = 1 1 

ASVABEI=12 

ASVABEI = 13 

ASVABEI = 14 

ASVAEEI=15 

ASVABEI = 16 

ASVABEI = 17 

ASVABEI=18 

ASVABEI = 19 

ASVABEI = 20 

ASVABEI=21 

ASVABEI=22 

ASVABEI = 23 

ASVABEI=24 

ASVABEI = 25 

ASVABEI=26 

ASVABEI=27 

ASVABEI = 28 

ASVABEI=29 

ASVABEI = 30 

ASVAENO=0 

ASVAENO=1 

ASVAENO=2 

ASVAENO = 3 

ASVAEN0=4 

ASVAENO=5 

ASVAENO=6 

ASVAENO = 7 

ASVAEN0=8 

ASVAENO=9 

ASVABNO = 31 

ASVABNO=32 



THEN SASVAEAI=26; 

THEN SASVAEAL=29; 

THEN SASVAEAE=31; 

THEN SASVA£AI=34; 

THEN SASVA£A£=36; 
THEN SASVABAD=39 
THEN SASVABAD=41 
THEN SASVABAD=44 
THEN SA£VABAD=46 
THEN SASVABAD=4S 
THEN SASVABAD=51 
THEN SASVABAD=54 
THEN SA£VABAD=57 
THEN SASVABA0=5S 
THEN SASVABA0 = 62 
THEN SASVABAD=64 
THEN SASVABAD=67 
THEN £ASVA3AD=69 
THEN SASVAEAD=72 
THEN SASVABAD=74 
THEN SASVABSD=77 
THEN SA£VA5AD=7S 
THEN SASVABAD=80 
THEN SASVA3AD=80 
THEN SASVABAD=80 
THEN SASVABAD=80 

THEN SASVA£EI=20; 

THEN SASVA£EI=20; 

THEN S ASVA£EI=21 ; 

THEN SASVA£EI=22; 

THEN SASVA££I=24; 

THEN SASVAEEI=26; 

THEN SASVAE£I=27; 

THEN SASVAEEI=29; 

THEN SASVAEEI=31; 



THEN 


SASVAEEI= 


32: 


THEN 


SASVABEI 


= 34 


THEN 


SASVABEI 


= 36 


THEN 


SkSViBlI 


= 37 


THEN 


SASVABEI 


= 39 


THEN 


SASVIBEI 


= 4 1 


THEN 


SASVABEI 


= 42 


THEN 


SASVJBEI 


= 44 


THEN 


SASVABEI 


= 46 


THEN 


SASVABEI 


= 4£ 


THEN 


SASVABEI 


= 49 


THEN 


SASVABEI 


= 51 


THEN 


SASVABEI 


= 53 


THEN 


SASVABEI 


= 54 


THEN 


SASVABEI 


= 56 


THEN 


SASV AEEI 


= 58 


THEN 


SASVABEI 


= 59 


THEN 


SASVABEI 


= 6 1 


THEN 


SASV AEEI 


= 63 


THEN 


SASVABEI 


= 64 


THEN 


SASVABEI 


= 6 6 


THEN 


SASVABEI 


= 66 


THEN 


SASVAENC= 


20; 


THEN 


SASV AENC= 


20 ; 


THEN 


SASVAENC= 


2 1 ; 


THEN 


SASVAENC= 


22 ; 


THEN 


SASVAENC= 


23; 


THEN 


SASVAENC= 


24 ; 


THEN 


SASVAENC= 


25 


THEN 


SASVAENC= 


26 ; 


THEN 


SASVAENC= 


27 ; 


THEN 


SASVAENC= 


28; 


THEN 


SASVAENO 


= 5t 


THEN 


SASVABNO 


= 51 



IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 

IF 



A2VAEKC = 
AFVA£liO= 
A£VAEIiC = 
AEVAEKC= 
A£VAEliC= 
ASVAENC = 
ASVABNC= 
A£7AENC= 
A£VAEUC= 
A£VABNC= 
A£VABNC = 
ASVAENC= 
A£"VAENC = 
A£VABNC= 
A£VAENC= 
ASVABNC= 
ASVA£KC= 
A£VAENO= 
A£ VAEKC= 



12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 

26 

27 

28 

29 

30 



THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 

THEN 



S A£VABNO = 
SA£VABNO= 
SA£VABNO= 
SA£VABNO= 
SA£VABNO= 
SA£VABNO= 
SA£VABNO= 
SA£VABNO^ 
S A£VABNO = 
SA£VABNO^ 
S A£VABNO= 
S A£VABNO= 
SA£VABNQ: 
SA£VABNO^ 
SA£VABNO= 
SA£VABNO = 
SA£VABNO^ 
S A£VABNO= 
S A£VABNO= 



^31 
^32 
■33 
■3‘i 
■■35 
■36 
■31 
■■38 
■■39 
= 40 
= 41 
= 42 
=43 
= 44 
= 45 
= 46 
= 47 
= 48 
= 49 



IF ASVABNO=33 THEN £A£VABNO=52 
IF ASVAENO=34 THEN 3J£VA£NO=53 
IF ASVABNO = 35 THEN 5A3VABN054 
IF ASVABNO=36 THEN £A£VABNO=55 
IF ASVABN0=37 then SA£VABNO=56 
IF ASVABNO = 38 THEN SA£VABNO=57 
IF ASVABNO=39 THEN £A£VAENO=5£ 
IF ASVABNO=40 THEN SA£VABNO=5S 
IF ASVABNO=41 THEN SA£VABNO=60 
IF ASVABNO=42 THEN SASVAENO=61 
IF ASVABNO=43 THEN SASVABNO=62 
IF ASVABNO=44 THEN SA£VAENO=63 
IF ASVABNO=45 THEN SASVABNC=64 
IF ASVAENO=46 THEN £A£VABNO=65 
IF ASVABNO=47 THEN SASVAENO=66 
IF ASVABNO=48 THEN £A£VABNO=67 
IF ASVABNO=49 THEN £A£VAENO=6£ 
IF ASVABNO=50 THEN £A£VABNO=69 



THE FCIICHING STATEHENTS CREATE THE NUMERIC VARIABLE 
•LC£MATHS' FROM THE VAEIAEIE 'INTHSRV*.; 

YE AR=£DE£Tfi {lNGTH£f V, 1,21 ; 

HCNIH=£CESTR (lNGTH£fiV,3, 2) ; 

YE AR£ = YEAR+0 ; 

MCNTH£=MCNTH + 0: 
lC£l!NlH£ = YEAE5*12+i;CNIHS ; 

RECCCING TO A CATEGCRICAL VARIABLE. ; 

IF MElIBfND=10 THEN DEPENDT£=0; ELSE DEPENCTS=1; 

CCNVERTING CHARACTEF VARIAEIES TO NUMERIC.; 

NUEY£AY = HYPAYGRD+C; NO NCI FC=NOTRCMD + 0 ; 

TO EIFINE THE EIGHE£T PAYGRALE ACHIEVED^ ACCORDING 
TO TEE BMIC FILE. ; 

IF IIIEF1G1=8209 TEEN PA YGRAEE=PAYGRDE1 : 

IF U1EFIG1 NE 82CS THEN PA YGRADE=PAYGRllE3 ; 

IF PAYGRADE=0 THEN PAYGR ADE=PAYGRD£1 ; 

IF fAYGEADE=0 THEN PAYGR ACE=' ; 

CREATING THE A£VAB COMPOSITE VARIABLE USED HHEN 
CLASSIFYING AD'S, AND ASSIGNING A DUMMY VARIABLE 
TO IDENTIFY THCSE SEC ACHIEVED THE MINIMUM SCORE.; 

ADCCMICS = SASVABAF+SASVABEI+SASVABGS+SASVABMK; 

IF AICCMFOS GE 19C THEN ADMJNSCR=1; 

ELSE ADMINSCR=0; 

SETTING UP DUMMY VARIABLES TC ALLOW ANALYSIS OF 
RACE AND SEX EFFECTS.; 



IF 


FACE=1 


THEN 


WHITE=1; 


ELSE 


IF 


£ACE=2 


THEN 


BLACK=1; 


ELSE 


IF 


FACE=3 


THEN 


OTEEB=1; 


ELSE 


IF 


SEX =2 


THEN 


N0SEX=0; 


ELSE 



CREATING A RANDOM VARIABLE TC ALLOW THE DATA TO 
BE SPLIT RANDOMLY IN HALF; 



IF EANUNI(O) <= .5 THEN £ANIALL1 = 1; ELSE RANDALL1 = 0; 
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* CBfJIJNG INTER5CTICK VARIAEIES fOR USE IN THE UCDEl 
DEVEICIMENI. ; 

INTEEC1 = ££PENETS*H£DG: 

IKTE£C2=EEPENETS*EIACK; 

INTEEC3=rEPENETS*HCSEX; 

INT£EC4=EEPENCTS*TERMENLT; 

IN IERC5=EEP ENETS* S AS VABA I; 
IKl£fC6=IEPENETS*AIilINSCB; 

I NT £EC7=HSDG* EL AC £ ; 

INI£EC8=ESDG*NUSE2 : 

INT£EC9=HSDG*TERi4£SlT; 

IKT£E10=ESDG*SASVA£AI; 

INTEE 11 = HSDG*ADMIN£CR; 

IKT£E12=ELACK=»NUS£X: 

INT£E 13=ELACK*TEEH£NLI; 

IN1£E14=£1ACK*SASVAEAI; 

INTEE 15=£LACK*ADMIi;SCR; 

IKIEE 16 = NDSEX*TERi!INlT; 

INTEE 17=N0SEX*SASTiAEAI; 

INIEE18=NUSEX*ADilIfSCE: 

INT£E19=lERaENLT*SAS7ABAI; 

IKlE£20 = TERHENLT*AIilINSCE; 

IKT££21 = SASVAEAI*AEilINSCE; 

* THE ECILCEING IINES CREATE E3EEERENT CRITERION 
VAEIAEIES.; 

IE ((SEEVICE1 EQ 21 AND ( (PAYGEADE GE 4) AND 
jNUHYPAY GE 4)1) THEN SUCCPAYG = 1; 

EISE SUCCPAYG=C; 

IF ENTEYYR=78 AND INTEYMTH GE 10 THEN LATEENLr=1; 
EISE LATEENLT=C* 

IE lAEHSI GE 48 OE ’(IAFU£1 GE 45 AND LATEENLT= 1) 
TEEN SUCCTAF=1; ELSE SUCCTAF=0 ; 

IE £LGEEUP1=4 THEN SUCCREUP=0; ELSE SUCCHEUP=1 ; 

IE SUCCREUP=1 AND SUCCTAE=1 AND SUCCPAYG=1 
TEEN SDCC£SS2 = 1; ELSE SDCCESS2=0; 



lABEI 

HSDG =HIGH SCHOOL GRADUA 
EEPENDTS=SINGLE, NO EEPENDE 
CHYEC =CCNVERT£D NEEEER 0 
KUHYfAY =NHEC FILE — HIGHEST 
NUNCTEC =NHEC — NOT RECOMMEN 
PAYGEADE=DI1EC-BA£ED HIGHEST 
£ASVAEGI=STANDAREIZE£ SCORE 
SASVAENC=STANDAREIZEE SCORE 
SASVA£AE=STANDAREIZEE SCORE 
SASVAEWK=STANDAREIZEI SCORE 
SASVAEAE=STANDAREIZEE SCORE 
£ASVAESE=STANDAR£I2£E SCORE 
SASVAEi!K=SIANDAREIZEr SCORE 
£ASVAE£I=SIANDA£EIZ£E SCORE 
SASVAEHC=STANDAEDIZEE SCORE 
£ASVAEG£=SIANDAREIZEE SCORE 
SASVAE£I=SIANDAEEIZEE SCORE 
£ASVAEAI=STANDAREIZEE SCORE 
RHITE ={1) WHITE, EISE (0 

ELACK =n) BLACK, EISE (0 

CIHEE =(1) NEITHER ELACK 

NUSEX =(1) HALE, (C) EEMA 

ADCOMPC£=AE ASVAE COMEOSITE 
ADMIN£CE=AE ASVAE COCECSITE 
EANEAI11=VAE. TO ALICE A RA 
IOSMNTH£=I£KGTH CF SERVICE 
ENTRYGFE=EN1RY GROUP CIASSI 
IATEENI1=ENIERED AFTER SEP 
SUCCTAF =SUCCESS ON ICS CRI 



TE ( 1) . OTHER (0) 

NTS fO), OTHERWISE (1) 

E YEARS OF EDUCATION 
EAIGEADE ATTAINED 
DED FOR RE-ENLISIHENT 
PAY-GRADE ATTAINED 

- GENERAL INFORMATION 
-NUMERICAL OPEEATIONS 

- ATTENTION TO DETAIL 

- aOED KNOWLEDGE 
-AEITHIIETIC REASONING 

- SPACE PERCEPTION 

- BATH KNOWLEDGE 

- ELECTRONIC INFO 

- HECH COMPREHENSION 

- GENERAL SCIENCE 

- SHOP INFORMATION 

- AUTO INFORMATION 

IcR WHITE, ELSE (0) 

IE 

SCREEN 

NDCB 50-50 SPLIT 
IN MONTHS 
EICATIONS 

78 jl) , OTHERWISE (0) 
TEEION (1) 
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SUCCIiYG= (1,0) SCCCESf ON PAYGBADE 

SUCC£EUf=ll ,0l ElIGIEIE TO EEENIIST 

SUCCE££2=SCCCESS ON CCNPOSIIE CEIIEfilON (1) 

INTEEC1=IEPINDTS*HSDG 

INTEBC2=IEPINDTS*B1ACK 

INTEBC3=EZP INDIS+NUSE2 

INTE£C4=£EPENDTS*TERMEN1T 

3NTE£C5=IEPENDTS^SASVA£AI 

INTEBC6=£EPINDTS*ADMIliSCR 

INTEBC7=ES£G*BLACK 

INTE£Ce=ES£G*NOSEX 

INTER C9 = E££G*TEE EE NIT 

INTER 10 = E££G*SASVABAI 

INTER ll = E££G*ADflINSC£ 

INTER 12=E1ACK*NUSEX 
INTER 13=E1ACK*TESMENII 
INTER 14=E1ACK*SA£VABAI 
INTER 15 = El ACK* All KINS Ci 
INTER 16 = N0£EX*TEfiMENlT 
INTER 17 = liD£EX*SASVA3AI 
INTER 18=K0£EX*ADNIN3CR 
INTER 19 = IE£KEN1T*SAS7A£AI 
INTER20=TEEHENLT*ADaiRSCR 
INTER21 = £A£7ABAI*ADMIliSCE; 

/♦ 

// 
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APfENEIX B 



EESCEIPTIVE AMALYSIS HESDLTS 

rreguency distr itutions and correlations used for 
descriptive analysis of tiie AD data set are : ortained in 
Tables XIV and XV. 

lie frequencies slow that S2 percent of the AD data set 
were 17 to 21 years cf age, 79 percent had a high school 
degree, 97 percent were single, and 98 percent were nale. 
Even though BLACK ard OTHEE only represented 17 and 6 
percent cf the sample respectively, their criterion scores 
were significantly different compared to WHITE criterion 
scores. Thus, BLACK and OTBEE emerged as predictors in seme 
cf the models. It is interesting to note that 40 percent of 
the sample achieved tie paygrade E-5. Using achievement of 
E-5 rather than E-4 ir the composite success criterion would 
produce greater variability on the criterion which nay 
improve the models, 

Cne third of the cases in the data did not score ISO or 
greater cn the AE composite score. The'se cases are either 
people wic were classified prior to correcting the ASVA3 
forms 5,6 and 7 misnciming protlems, or people who migrated 
to the AE rating sclseguent to service entry. This may 
partially explain the negative correlations these variables 
have with the criteria. 
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TAEIE Ili 

Selected Preguencies 



itrcPAiP 


FINAL EAIING 
FEEQUENCY 


AS IISIED 
CUM EEEQ 


BY D. M.D 
PERCENI 


.C. 

cua 


PEECEKI 


AE 


2820 


2820 


100.000 




1 CO, 000 


SCPEPK 


SCfiEEN SCORE 
FEEQOEKCY COM FREC 


PERCENT 


CUM 


PEECEKI 


• 

52 


73 

2 


2 


0l073 




Cl073 


c c 


1 


3 


0.036 




C. 109 


57 


c 


8 


0. 1 82 




C.291 


5S 


■ 


1 1 


0. 109 




0.400 


6 1 




12 


0.036 




0.437 


6 2 


6 


18 


0.218 




0.655 


63 


7 


25 


0.255 




0. 910 


6y 




28 


0. 109 




1.C19 


66 


36 


64 


1.311 




2.330 


68 


54 


1 18 


1.966 




4.296 


70 


66 


184 


2.403 




6.698 


71 


24 


208 


0.874 




7.572 


7 2 


145 


353 


5.278 




12.850 


73 


c 


358 


0. 1 82 




13.032 


74 


106 


464 


3.859 




16.891 


75 


50 


514 


1.820 




18.7 11 


76 


37 


551 


1.347 




20.058 


77 


78 


629 


2.839 




22.898 


78 


188 


817 


6.844 




29.742 


7S 


172 


989 


6.261 




36.003 


80 


13 1 


1120 


4,769 




40.772 


81 


105 


1225 


3.822 




44.594 


82 


51 


1276 


1.857 




46.451 


83 


56 9 


1845 


20.714 




67. 164 
7 0. 4 40 


84 


90 


1935 


3.276 




85 


2 


1937- 


0.0 73 




7C.513 


86 


13 


1950 


0.473 




70. 987 


87 


1 1 1 


2061 


4.041 




75.027 


88 


424 


2485 


15.435 




90.462 


8S 


27 


2512 


0.983 




91.445 


90 


208 


2720 


7.572 




99.017 


91 


3 


27 23 


0. 109 




99. 126 


92 


c 


2728 


0.182 




99.308 


93 


4 


27 32 


0.146 




99.454 


S4 


1 


2733 


0.036 




99.490 


95 


14 


2747 


0.5 10 




1 00. COO 


APCIGBPS 


AFQT GFCOPS 
FEEQOEKCY 


{ 5,4C,4B,4A 
CUM FREQ 


,3B,3A,2 

PERCENI 


.1) 

CUl 


PERCENT 


1 


4 


4 


0.142 




0. 142 


2 


6 1 


65 


2. 163 




2.305 


•3 


280 


345 


9.929 




12. 234 


4 


599 


944 


21.241 




33. 475 


c 


795 


1739 


28.191 




61.667 


6 


545 


2284 


19.326 




80.993 


7 


505 


2789 


17.908 




98.901 


8 


3 1 


2820 


1.099 




1 00.000 
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AGE OF INCIVICUAI AT TIME OF ENTfiY 



EKTBYAGE 


PEEQUENCY 


CUM FEEQ 


PEECENI 


CUM PERCENT 


17 


388 


388 


13.759 


13.759 


1 8 


1225 


1613 


43.440 


57. 199 


19 


594 


2207 


21.064 


78.262 


2 C 


263 


2470 


9.326 


87.569 


21 


136 


2606 


4.8 23 


92.4 11 


22 


75 


268 1 


2.660 


95.071 


23 


49 


2730 


1.738 


96.809 


24 


28 


2758 


0.9 93 


97.801 


25 


22 


2780 


0.780 


98.582 


26 


1 1 


2791 


0.390 


98.972 


27 


12 


2803 


0.426 


99.397 


26 


8 


2811 


0.284 


99. 681 


29 


8 


2819 


0.284 


99. 965 


30 


1 


2820 


0.035 


100.000 




ENTRY 


PAY GRACE 


(EOO — 01 1) 




ENTER AYG 


FEEQOENCY 


COM FEEQ 


PERCENT 


CUM PERCENT 


1 


2375 


2375 


84.220 


84.220 


2 


279 


2654 


9.894 


94. 1 13 




166 


2820 


5.887 


1 CC.OOO 



TIFHENIT 



3 

c 

€ 



TEEM OF ENLISTMENT (NO. 
FEEQUENCY COM FEEQ 



2691 2694 

1 2695 

125 2820 



OF YEAES) 
PERCENT CUM 

0.035 

0.035 

95.461 

0.035 

4.433 



PERCENT 

0.035 
C.071 
95. 532 
95.567 
1 CC.OOO 





SERVICE 


OF ACCESSION 


(NAVY, 2) 






ERVACCS 


FREQUENCY 


COM FEEQ 


PERCENT 


CUM 


PERCENT 


2 


2715 


2715 


96 .277 • 




96.277 


8 


105 


2820 


3.723 




100.000 


CONVERTEI NUMBER OF YEARS 


OF ECOCATION 




CEYEC 


FREQUENCY 


COM FREQ 


PEECENI 


CUM 


PERCENT 


3.5 


1 


1 


0.035 




C.035 


6 


4 


5 


0.142 




0. 177 


c 


27 


32 


0.957 




1. 135 


10 


143 


175 


5.071 




6.206 


1 1 


285 


460 


10. 106 




16.3 12 


11.5 


122 


582 


4.326 




20.638 


12 


2165 


2747 


76.773 




97. 4 11 


13 


29 


2776 


1.028 




98.440 


14 


26 


2802 


0.922 




99. 362 


15 


7 


2809 


0.248 




99.6 10 


16 


1 1 


2820 


0.390 




1 CC.OOO 




HIGB-SCBCOL 


GRACUATE (1) 


V. OTHER (0) 




ESEG 


FREQUENCY 


CUM FEEQ 


PEECENI 


CUM 


PERCENT 


0 


582 


582 


20.638 




20.638 


1 


2238 


2820 


79.362 




ICC. 000 
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SINGLE. NC EEPENEENIS 10 ), OIHEBSISE ( 1) 



EEf ENEIS 


FBEQUEKCY 


COM lEEC 


PERCENT 


CUM PEECENI 


0 


2736 


2738 


97.092 


97.092 


1 


82 


2820 


2.906 


1CC.0C0 



(1J MALE, (0) FEMALE. 



KCSEY 


FREQUENCY 


CUM FREQ 


PERCENT 


cua 


PERCENT 


C 


64 


64 


2.270 




2. 270 


1 


2756 


2820 


97.730 




1CC.0C0 




ENTRY GROUP CLASSIFICATIONS 






EKTEYGFP 


FREQUENCY 


CUM EREQ 


PERCENT 


CUM 


PERCENT 


1 


1 166 


1166 


41 .348 




41. 348 


3 


128 


1294 


4.539 




45.667 


"c 


1316 


2610 


46.667 




92.553 


7 


210 


2820 


7.447 




1 CC. OCO 



(1) HEITE, (2) ELACK, (3) OTHEE 



RACE 


FREQUENCY 


CUM EREQ 


PERCENT 


CUM 


PERCENT 


1 


2184 


2164 


77.447 




77.447 


2 


466 


2652 


16.596 




94.043 


- 


166 


2820 


5. 957 




IOC. 000 




AC ASVAB 


CCMPCSITE 


SCREEN 






AICINSCE 


FREQUENCY 


COM EREQ 


PERCENT 


CUM 


PERCENT 


0 


945 


945 


33.511 




33.511 


1 


1875 


2820 


66.489 




ICO. 000 




VAR TO AILCW 


A RANCOM 


50-50 SPLIT 




EANEALI 1 


FREQUENCY 


COM EREQ 


PEECENI 


CUM 


PERCENT 


0 


1380 


1360 


48.936 




48.936 


1 


144C 


2820 


51.064 




ICC. 000 




INTEF-SERVICE SEPARATION CODE 






ISC3 


FREQUENCY 


COM FREQ 


PERCENT 


CUM 


PERCENT 


0 


1106 


1106 


39.220 




39.220 


1 


1495 


26C1 


53.014 




92.234 


6C 


22 


2623 


0.780 




93.0 14 


6 1 


6 


2629 


0.2 13 




93.227 


63 


1 


2630 


0.035 




93.262 


64 


7 


2637 


0.248 




93.5 11 


65 


6 1 


2698 


2.163 




95.674 


67 


1 4 


27 12 


0.496 




96. 170 


71 


7 


2719 


0.248 




96.416 


7 3 


15 


2734 


0.532 




96.950 


74 


1 


2735 


0.035 




96.966 


75 


2 


2737 


0.071 




97. 057 


76 


7 


2744 


0.243 




97.3C5 


78 


22 


2766 


0.780 




98.085 


80 


4 


2770 


0.142 




98.227 


82 


2C 


2790 


0.709 




98.936 


86 


3C 


2820 


1.064 




ICC. OCO 



78 



( 


1) ENTEEII AFTEE SEP 78 


, OTHERWISE 0) 




IMfiNlI 


FEEQUEKCY 


COM FREQ 


PEECENT 


CUM 


PEECEKI 


C 


2 54 3 


2543 


90.177 




90. 177 


1 


277 


2820 


9.823 




1 CC.OCO 


DMDC-BASir HIGHISI PAY- 


GRADE ATTAINED 




f AYGFACE 


FEEQUEKCY 


COM FREQ 


PERCENT 


CUM 


PEECENT 


1 


lie 


1 10 


3.901 




3.901 


2 


108 


2 18 


3.830 




7.730 




23 1 


449 


8.191 




15.922 


4 


1259 


1708 


44.645 




60.567 


c 


1 1 10 


2818 


39.362 




99.929 


6 


2 


2820 


0.071 




ICC. COO 




NHSC FIIE — HIGHEST PAYGRADE ATTA 


I NED 




tUHSPAS 


JEEQUEECY 


COM FREQ 


PERCENT 


CUM 


PEECENT 


1 


17 


17 


0.603 




0. 603 


2 


95 


1 12 


3.369 




3.972 




220 


332 


7.801 




1 1.773 


4 


1401 


1733 


49.681 




61.454 


C 


108 7 


2820 


38.546 




100.000 




SUCCESS 


ON ICS CRITERION (1) 






SOCCIAf 


FEEQOEECY 


COM FREQ 


PERCENT 


CUM 


PEECENT 


0 


393 


393 


13.936 




13.936 


1 


2427 


2820 


86.064 




1 CC.OCO 




HIGH PAYGRADE SUCCESS CEITERIO 


N. 




SCCCFAYG 


FEEQOEKCY 


COM FREQ 


PERCENT 


CUM 


PEECENT 


0 


474 


474 


16.809 




16.809 


1 


234€ 


2820 


83.191 




ICC. 000 


• 


REEKLISIEENI 


EIIGIEIIITY CRITERION. 




SCCCBEUE 


FEEQUEKCY 


CUM FREQ 


PERCENT 


CUM 


PEECENT 


0 


293 


293 


10.390 




1C. 390 


1 


2527 


2820 


89.610 




1 CC. OCO 




SUCCESS CN 


CO MPCSIIE 


CRITERION 


(1) 




SCCCESS2 


FEEQUEKCY 


CUM FREQ 


PERCENT 


CUM 


PEECENT 


0 


655 


655 


23.227 




23-227 


1 


2165 


2820 


76.773 




ICC. OCO 


MONTHS OF ICTL 


. ACTIVE FED. MILIT. 


SER V 




TAIMS 1 


FEEQUENCY 


COM FREQ 


PERCENI 


CUM 


PEECENT 


2 


1 


1 


0.035 




C.035 


7 


4 


5 


0. 142 




C. 177 


8 


2 


7 


0.071 




0.248 


c 


€ 


13 


0.213 




0.461 


10 


1 


14 


0.035 




0.496 


1 1 




17 


0.106 




0.603 


12 


8 


25 


0.284 




0.887 


15 


9 


34 


0.319 




1.206 


14 


C 


39 


0.177 




1.383 


15 


C 


44 


0.177 




1.560 



79 



16 


S 


53 


0.319 


1.879 


17 


c 


58 


0. 177 


2.057 


18 


C 

V 


67 


0.319 


2.376 


IS 


1 1 


78 


0.390 


2.766 


20 




81 


0.106 


2.872 


2 1 


C 


86 


0. 177 


3.050 


22 


c 


91 


0.177 


3.227 


2 j 


1C 


101 


0.355 


3.582 


24 


s 


1 10 


0.319 


3. 901 


25 


c 


115 


0.177 


4.078 


26 


10 


125 


0.355 


4.4 33 


27 


8 


133 


0.284 


4.7 16 


28 


7 


140 


0.243 


4. 965 


2S 


c 


145 


0. 1 77 


5. 142 


3C 


2 


147 


0.071 


5.2 13 


3 1 


7 


154 


0.248 


5.461 


32 


4 


158 


0.142 


5. 603 


" - 


c 


163 


0.177 


C C A 


34 


4 


167 


0.1 42 


!’ 922 


•3 c 


- 


170 


0.106 


6.028 


36 


7 


177 


0.248 


6.277 


37 


1C 


187 


0.355 


6.631 


38 


7 


194 


0.243 


6. 8 79 


3S 


c 


1S9 


0.177 


7.057 


40 


7 


2 06 


0.243 


7.305 


4 1 


6 


2 12 


0.213 


7.5 18 


42 


6 


218 


0.213 


7.730 


43 


c 


223 


0.177 


7.908 


44 


4 


227 


0.142 


8.050 


45 


76 


303 


2.695 


10.745 


46 


9 1 


394 


3.227 


13.972 


47 


248 


642 


8.794 


22.766 


48 


1113 


1755 


39.468 


62.234 


4S 


97 


1852 


3.440 


65.674 


5C 


9 1 


1943 


3.227 


68.901 


5 1 


1 1 1 


2054 


3.936 


72. 837 


52 


56 


2 1 10 


1.986 


74.823 


c; - 


5 1 


2161 


1.809 


76.631 


54 


6 1 


2222 


2.163 


78.794 


55 


57 


2279 


2.021 


80.8 16 


56 


55 


2334 


1.9 50 


82.766 


57 


48 


2382 


1.702 


84.468 


58 


34 


2416 


1.206 


85.674 


c c 


57 


2473 


2.021 


87.695 


60 


53 


2526 


1.879 


89.574 


6 1 


59 


2565 


2.092 


91.667 


62 


53 


2638 


1.879 


93.546 


63 


36 


2674 


1.277 


94.823 


64 


30 


2704 


1.064 


95.887 


65 


16 


27 2 0 


0.567 


96.454 


66 


22 


2742 


0.780 


97.234 


67 


13 


27 55 


0.461 


97.695 


68 


20 


2775 


0.709 


98.404 


6S 


1 5 


2790 


0.532 


98.936 


70 


1 1 


2801 


0.390 


99.326 


7 1 


17 


2818 


0.603 


99.929 


72 


2 


2820 


0.071 


1 00.000 
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TABLE 2V 

Selected Correlations 





IAPMS1 


SUCCTAF 


S0CCESS2 


AICTGIfS 


-0.08930 

0.0001 


-0.06108 

0.0012 


-0.05137 

0.0064 


AFCaiCNT 


-0.077 55 
0.0001 


-0.05346 

0.0045 


-0.04182 

0.0264 


EMlfiXiGE 


0.055 18 
0.0034 


0.02593 
0. 1 686 


0.056S8 

0.C025 


ENaPfAKG 


0.03578 

0.0575 


-0.00926 

0.6230 


C. 01083 
0.5653 


TIIfjEKlI 


0. 14720 
0.0001 


0.02116 

0.2614 


0.05189 
0. 0059 


CBIEC 


0.07522 

0.0001 


0.07554 
0.000 1 


0. 11471 
O.OOCl 


HSIG 


0.099 18 
0.0001 


0.12117 

0.0001 


0. 15525 
C.OOOl 


NDSEX 


0. 08426 
0.0001 


0.04868 

0.0097 


0.01204 

0.522S 


HEIIE 


-0. 10983 
0.0001 


-0.06035 

0.0013 


-0.04767 
0.01 14 


ElACK 


0. 10220 
0.0001 


0. 05290 
0.0050 


0. 02641 
0. 16C8 


CTEEE 


0-03329 

0.0771 


0.02342 

0.2139 


0.04265 

0.0235 


SCEEEN 


-0.00478 

0.8022 


0.07461 

0.0001 


0. C68S1 
O.OOCl 


Aiccaics 


-0.05463 

0.0037 


-0.02132 

0.2578 


0. 00440 
0.8153 


AEEIBSCE 


-0.07581 
0.00 01 


-0.02971 
0. 1 147 


-0.02934 

0.11S2 


SASVAEAE 


0.00263 

0.8888 


0.01025 
0. 5864 


0.01356 
0.47 17 


SASVAIAI 


-0.06941 

0.0002 


-0.03415 
0. 0698 


-0.00654 

0.7285 


SASVAEAB 


-0.051 63 
0.006 1 


-0.02568 
0. 1727 


-0.01734 

0.3574 


SASVAEEI 


-0.031 40 
0.0955 


-0.01303 

0.4893 


0.01707 

0.3648 


SASVAIGI 


-0.01535 

0.4152 


-0.02107 

0.2633 


-0.00708 

0.707C 


SASVAEMC 


-0.06570 
0. 000 5 


-0.04088 

0.0300 


-0.02361 

C.2101 


SASVAEI3K 


-0.03166 

0.0928 


0.00698 
0.71 12 


0.02268 

0.2245 
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SiiSVAZNC 


-0- 038 6S 
0.039S 


-0.01504 

0.4246 


-0.00541 

0.7741 


SASVAISI 


-0. 03834 
0.041 8 


-0.00090 

0.9618 


-0.C0367 
C. 8370 


SA£VAiSf 


-0. 04660 
0-0129 


-0.02609 

0-1660 


-0.C0735 
0. 6964 


SASVAiGS 


-0.03464 
0.065 9 


-0.02632 
0. 1622 


-C. 01036 
0.5822 


SASVAiHK 


-0.06134 
0.001 1 


-0.05225 

0.0055 


-0.04763 

0-0111 


INIZBOl 


0. 05262 
0.0052 


0.03656 

0.0523 


0.04814 

0.0106 


011602 


0.05869 
0.00 18 


0. 02943 
0. 1 182 


0-04022 

0.0327 


IN1IBC3 


0. 07253 
0.0001 


0.03665 

0.0516 


0-04666 

0.0132 


INlEBCil 


0.062S7 

0.0008 


0.03358 

0.0746 


0-04590 

0.0148 


OIEBC5 


0.06033 
0.00 13 


0.0310 1 
0.0996 


0. 04114 
0-0289 


IN1IBC6 


0.01791 
0.34 17 


0. 02303 
0.2215 


C.022S9 

0.2222 


01EBC7 


0.096 19 
0.0001 


0-05711 

0.0024 


0- C3899 
0-0384 


01EBC8 


0. 11852 
0.0001 


0.12665 

0.0001 


0. 14906 
0.0001 


IN1EBC9 


0. 12282 
0.00 01 


0-11899 

0.0001 


C. 16000 
0.0001 


OIEBIO 


0-07658 

0.00C1 


0. 10700 
0.000 1 


0. 14937 
0. 0001 


IN1EB11 


0.00621 
0.74 16 


0.04010 

0.0332 


0.06994 

0-0002 


IN1EB12 


0. 10322 
0-00 01 


0.05193 

0.0058 


0.02540 
0. 1775 


INIEB 13 


0. 109 65 
0.0001 


0.05469 

0.0037 


0. 02934 

0.1193 


INIEB 14 


0. 10565 
0-0001 


0. 05537 
0. 0033 


0.032 18 
0-0876 


IN1EB15 


0.060 99 
0.00 12 


0.03034 
0. 1073 


0. 00764 
0-6852 


IN1EB16 


0. 14975 
0.00 01 


0.05095 
0- 0068 


0.03845 

0-0412 


INIEB17 


-0.00842 

0.6550 


0.00025 

0-9896 


0.00160 

0-9240 


IK1EB18 


-0.051 98 
0-0058 


-0.01436 
0. 4460 


-0.02373 

0.2077 
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IKIEB 19 



0.00629 

0.7384 



IMEE20 

IN7EE21 



•0.05623 
0.00 28 

•0.08291 

0.0001 



-0.02271 

0.2280 

-0.02818 
0. 1347 

-0.03788 

0.0443 



C.C1637 

0.3849 

-O.C 

0 . 

-0. 02863 
0.1266 



Ncte; lie first rnjmber is the correlation be: ween 
the predictor and the criterion, the second 
number is the significance level. 
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APiENEIX C 

BEGBIESION INAIISIS PSOGBAMS 



E€cr€ssion aialysis atteufts to predict or explain the 
values cf tie criterion variable with one or moc e predictor 
variables- The following sections expand upon the discus- 
sion cf regression analysis presented in Chapter 17. 

A. f fCOlEEMEBTS AND iSSOaPIICKS 

Hter ccrducting regression analysis, certain reguire- 
ments aust be met or assumed. One of tiiese requirements is 
the use cf quantitative variables. s Application of regres- 
sion procedures also requires normality (the value of the 
dependent variable must be normally distributed at eacn 
value of the independent variable), homoscedas ticity (the 
variation around the regression line must be constant for 
all values cf the independent variable), and independence of 
error (the residual difference between an observed and 
predicted value cf the dependent variable must be indepen- 
dent fcr each value cf the predictor variable) . Another 
requirement of linear regression is that a straight-lire cr 
linear relationship exist between each independent variable 
and the dependent variable. For purposes of this study, and 
based cr initial irvestigaticn, these requirements are 
assumed to be met. Ecwever, an extensive effort tc evaluate 
these assumptions by transforming the variables or employing 
complex statistical analysis packages has not been 
conducted. 



-The inclusion of qualitative or categorical variables 
in regression models may be accommodated through the use of 
dummy 'variables. 



E 



SHESISE BEGBESSICN 



lie EAS Stepwise process considers each of the candidate 
independent variables for inclusion in the model by deter- 
nininc the contribution the variable makes to the icdel. 
Ihis deteriiina tion is ac cem plished by calculating the 
partial E statistic fer the variable, and adding it tc the 
model if it meets the specified entry significance level- 
After a variable is added, the stepwise method then Iccks at 
all the variables in the model and deletes any variable that 
does net previde an E statistic sufficient to meet the spec- 
ified significance level for remaining in the model. Ihis 
process of adding ard deleting variables continues until 
none cf the variables has an F statistic significant to 
enter cr leave the mcdel.® [Bef. 12] 



C. IIBEIB BEGBESSIOB 



Simple linear regression is concerned with finding the 
statistical model or eguaticn that best "fits" the original 
data. Ihis is accomplished by defining a straight line that 
minimizes the differences between the actual value of the 
dependent variable and the value that would be predicted 
from the fitted line of regression. The SAS Regression 
procedure uses a mathematical technigue, the le ast-sguaxes 
method, tc produce such an eguation for the best linear 
model. Ihis eguaticn provides the intercept and slcpe of 
the sample predictor variable. With multiple linear regres- 
sion, these slopes represent the unit change in the depen- 
dent variable per unit change in the independent variable, 
taking into account the effects of the other independent 
variables, and are referred to as net regression coeffi- 
cients. Ihe sample regression coefficients of the predictor 



used the SAS Stepwise default significarce 
level cf ,15 for variables to enter or remain in the model. 



variatl€S are then used as 
laticx faraaeters. lor il 
validate Model A is provide 



estimates of the respective fOfu- 
lustration, the program used to 
d ir Taole XVI. 
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TAIIZ XVI 

Sample Validaticu Program 



//AEVllJI JCB (2807,C110),'D CSLDND, SMC 1 763 • , CL A2 £=E 
C£G = NPGV«1. 2807P 
// EiiC £A£ 

y/FIlEIN EE DI£P=£HE,D£N=M££. £2807.ADA1L4 
/ySY£3N ED =» 

CPTICKS 1£=€0 NOCENIEf: 



* IHI£ PfOGPAIl CAICUIAIES TEE VALIDITY CF A 
MCEEl THECUGH THE C£E OF C£0£S- VALIDATION 
CEC£S-VAIIEAIICN TICHNIQOE£.; 

DATA lATAl:' 

£ET EILEIN. ADALL4 : 



REGRE SSICN 
AND DCUEIE 



THE BANDCN VABIAELE CREATED IN 'ADNEWVAR* 
TO £PIIT THE DATA APPfiOXI M AT EL Y IN HALF. 
TEE DERIVATION SAMilE AND ’VALIDA' IS THE 
VAIIEATICK SAMPLE- : 



IS NOW USED 
•EERIVA' IS 
HCLD-OUT OR 



EATA lEBlVA: 

SET EilAl: 

IE EAKIAIL1 = 1; 

EATA VIIIEA; 

SET EATAI: 

IF EANEAILI = 0; 

* A EICCK REGRESSION IN NOW RUN ON DERIVA TO COMPOTE ANE 
OUTPUT THE PARAMETER ESTIMATES (BETAS) THAT RESULT 
FRCM TEE REGRESSION. THE BETAS ARE WRITTEN TO THE DATA- 
SET WCEK-EETAD. TEE MODEL IS GIVEN THE LABEL 'TAFMHATV; 



PROC EEG EATA=DEEIVA CUTES T=EETAE : 

TAEMHATV:MODEL TAIKSl = ADMINSCR TERMENLT DEPE NDTS EIACK 

HSDC- OTHER NUSEX / STE; 

TITLE REGRESSING ON EERIVA; 

* THE NEXT STEP IS TC APPLY THE REGRESSION FORMULA (THE 

BETAS) TO THE DATA IN THE VALIDATION SAMPLE AND CAICUIATE 
THE PEEEICTED SCORE FOR EACH CASE IN VALIDA. THE PEEE- 
ICTEE SCCEES AEE WRITTEN TC WORK. PR EDI AF V . SAS USES THE 

AS THE VARIABLE NAME FOR THE VALIDA 
SCORE PROCEDURE PRODUCES NO 

PRINTED OUTPUT.: 



MOEEI lABEL (TAFMHATV) 
PREEICTEE SCORES. THE 



PROC SCCFE CUT=PREDTAFV TYPE=01S SCORE=BETAD 

PfiFPTCT* 

VAR aSiNSCR lERHENLI DfePENDIS BLACK 
HSDG OTHER NOSEX: 



* THE FIRST VALIDITY COEFFICIENT IS NOW CALCULATED BY FINI- 
ING TEE CCRRELATION BETWEEN VALIDA’S ACTUAL SCORES ANE 
VALIDA *S PREDICTED SCORES.; 

PROC CCER E ATA=PREET AFV: 

VAR TAFMS1 TAFMEATV; 

TITLE FIRST VALIDITY COEFFICIENT.; 
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* NCt) IC fiEEEAT IHE EBOCESS IC OIILIZE THE DOUBLE CfC 
VAIJrATIC^ TECHNigCi. THIS TIME A BEGEESSICN 15 ?.0 
ON ViXJEA AND IHE BESULTING EETAS (BETAV) ABE USED 
PREDICT ItE SCORES CE IHE CASES IN DERIVA. DERIVA' 

ACIUAI AND PREIICIII SCORES dEE THEN CORBEIAIED 10 
PINE TEE SECOND VAIIDIIY COEEfICIENT. ; 

BEOC BEG D5IA=VAIIDA CUT£ST=EEIAV : 

IAEMHAID:MODEI TAIES1 = ADHINSCR TERHENLI DEPENDIS ElACK 

ESDG OTHER NUSEX / SIE; 

TITLE REGRESSING ON VAIIDA; 

BEOC SCCBE COT=PEEDTAED TYPE=C1S SCORE=BETAV 
rMA = DERIi;A PBEDICT: 

VAB AIMIKSCR lEENEMT DEPENDIS BLACK 
HSDG OTHER NUSEX; 

PEOC CCER r ATA=PEEET5ID: 

ViE IAEMS1 lAPMEAID: 

TITLE SECOND VAIIDIIY COEFFICIENT; 

/* 

// 
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DISCBimUANT ANALYSIS PBOGEAMS 



LiscriainaDt Analysis allows observations to be cl 
fied into two ox eoi 6 groups on the basis of one or 
numeric variables. Ihe following sections expand upo 
discussion of discrininant analysis presented in Chacte 
lor illustration, lable XVII shows the program use 
produce the classification matrices for the derivatic 
valicaticr samples fcr Model A. 



A. BICOIBihENIS AND ASSOMPIICNS 



As was the case with regression analysis, iiscrin 
analysis also reguires that certain basic assumption 
met. hirst, the chservaticns in the data set shcu 
members cf twc ci more mutually exclusive gr 
Iherefore, the groups must he defined so that each case 
belong to only one group. Another statistical pro 
required of discriminating variables is that they may n 
linear ccmbinaticns cf other variables. Thus, the s 
average of several variables may not be used along 
those variables. There are three other assumptions 
considered. The population covariance matrices mus 
equal fcr each group, each grcup is to be drawn from a 
laticn which has a multivariate normal distribution, 
discriminating variables must be measured at the interv 
ratio levels. Ideally, these variables will be contin 
but they need not be. [Bef. 17] This study assumes 
requirements have beer met. However, an effort to eva 
these properties was rot conducted since, in practice, 
discriminant analysis technique is rather robust and 
tolerate some deviation from: these assumptions £Bef. 18 
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f. L3SCEIBINANT ANAIliSIS 

lie first step cf discr iminart analysis is to weight ard 
linearly ccrhine the discr irirating variables so that the 
groups will be as statistically distinct as possible. Ihe 
derived equations, called discriminant functions, ccntine 
the crctp characteristics using a measure of generalired 
squared distance'^ that will allow one to identify the crcup 
to which a case telcncs or most closely resembles. 

Ihe classif icaticn process may assume that membership in 
a group has equal liJtelihood cf occurring. However, it nay 
be mere desirable tc incorporate the prior probability of 
group membership intc the classif ication function tc improve 
prediction accuracy or minimize the cost of prediction 
errors. In this studj, membership in a success group was on 
the order cf 80 percert. Therefore, it was appropriate to 
consider prior probabilities so that those cases predicted 
as unsuccessful would be classified as such only if strong 
eviderce exists that they belong there. 

Ihe ultimate concern in developing a mathematical model 
is that it predict well or provide a reasonable description 
cf the real world. Cnee a model is developed which provides 
satisfactory discrimination for cases of group membership, 
classification functions may be derived and applied tc the 
classi f ication cf new cases with unknown group membership. 
A good test of the adequacy and accuracy of the discriminant 
model is the percentage of correct classifications, ccmmorly 
called the ’’hit-rate". Ihis test is accomplished by 
applying the classif ication function to the known cases from 
which the function was derived. The percentage of correctly 



■'The procedure ccnducted a likelihood ratio test of 
homogeneity of the within-group covariance matrices for each 
model. This test was s£ atistically significant for each 
model. Therefore, the within-group matrices were used as 
the basis cf the measure of generalized squared distance in 
develcping the classificati cn criterion. [Bef. 12J 
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classiii€d cases provides an indication of tiie accuracy of 
the procedure and indirectly cenfirms the degree cf greup 
separation. The results may he depicted in a classification 
ratri . 

Khen the sample size is large enough, as it is it this 
study, a further check of the classification accuracy may be 
conducted by randomly splitting the sample into two subsets. 
The classif icaticn function is derived on one subset and 
validated on the otter subset. A comparison of the two 
hit-rates provides the measure of accuracy of the mcdel. 
£fief, 17* 
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TAE1£ XVII 

Sample Discrimisart Analysis Program 



//DISCfGtS JOB (2807 -C110) , ’E CSLUND, SMC 1763' ,3 LA£S = E 
//♦MAIN CEG=NPGVM1 . 2fcC7P 
// £X£C SA£ 

y/FIl£IN EE D ISP=SH£ ,DSN=M SS. S2807 . ADA1L4 
//SYSJN EE * 

CPTIC^£ 1£=80 NOCENIIE: 

* THIS PDEPCSE OF THIS PEOGEAM IS TO ALLOW THE Vi LIDIIY 
OF A EISCEIMINANT KCEEL TO E£ INVESTIGATED. A CLASS- 
IFICATION FUNCTION IS DERIVEE FROM THE DERIVA SAMPIE 
ANE THIS FUNCTION IS USED TO CLASSIFY THE CASES IN IHF 
VAllEATICK (Ofi HCLE-OUI) SAMPLE. THE TWO ClASSI F IC AT 10 N 
MAIEJCIES Afi£ THEN USED TO AILCW THE 'HIT RATE' ON EACH 
SAEEIE TO BE CALCULATED. ; 

lATA IATA1: 

SET FIIEIN. ADA1L4 ; 

* USING TEE EANDCM VAFIAELE TC SPLIT THE SAMPLE APPfiCXIt- 
ATEIY IN HALF. ; 

DATA EEEIVA: 

SET EATAl: 

IF EANEA111 = 1 ; 

EAIA VAIIEA: 

SET EATAl; 

IF fiANEAlil=0; 

* CAICUIAIING THE Cl ASSIFIC ATIC N MATRIX FOR DERIVA ANE 
WRITING OCT THE CL ASSIFIC ATIC N FUNCTION DERIVED FROM 
DERIVA TC HORK.D. ; 

PROO EJSCRIM DATA=DEFIVA OUT=E POOL=TEST; 

CLASS SCCCTAF: 

VAR DEPENDTS HSDG BLACK TERMENLT 

KCSEX OTHER ADMINSCR; 

PRICES PROPCRTIONAI: 

TITLE EISCRIM ON DERIVA.; 

* NOW TEE CIASSIFICATICN FUNCTICN FROM DERIVA IS USEE TC 
CLASSIFY THE CASES IN VALIDA.; 

PROC EISCRIM DATA=D TESTDAT A=VALIDA ; 

TESTCIASS SUCCTAF * 

TITLE‘'dE£IVA" S FUNCilCN APPLIED TO VALIDA.; 

/* 

// 
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ailllTY ANALYSIS PBOGfiAMS 



liis a^fendix picvides further details of the irfcrna- 
tion ccrtaired ir Chajter V, acd gives examples of the SAS 
programs acd outputs. 



A. CAICCIAIION OF C 

Ihe method used 
studj deperds od w 
model is heing evalu 
simply as a fcrmu 
whereas a discrimin 
predicted successes 
difference, the ca 
complicated for re 
models . 



Ill PBOEAEILITIES 

tc calculate cell probabilities in this 
hether a regression or a discriminant 
ated. A regression model can be viewed 
la for calculating predicted scores, 
ant model actually classifies cases as 
Cl predicted failures. Because cf this 
Iculaticn of cell probabilities is mere 
gressj.on models than for discriminant 



1 • Eeq ress i cn Models 

A regression model and the data from which it was 
developed provide irfermatior on the predicted and actual 
scores fer each case. In order to classify these cases. into 
the four selection outcomes, the cut score on the predictor 
and the score cn tie criterion above which people are 
considered to be successful must be known. If the criterion 
is constructed as a dichotemeus (success/fail) variable, 
then the cases assigned a value of ''one” are considered 
successful and those with a value of "zero” are considered 
unsuccessf ul. If the criterion is a continuous variable 
(such as length cf service) then a value on the scale mrst 
be chosen as the dividing line between success and failure. 
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3he cicice cf the cut score is net such a simple matter, and 
cannot te arhitrarily assigned as can the distinction 
tetween success and tail. Ihe choice of the cut sccre, as 
mentioned before, often depends on the desired selection 
ratio. In the absence of information on the desired selec- 
tion ratio, cell piciabilities are calculated for each of 
many possible cut scores, and a cut score is eventually 
choser based on which set cf cell probabilities maxiaizes 
the utility of the mcdel. In a data set containing actual 
and predicted scores, different sets of ceil probabilities 
can b€ calculated if each predicted score is considered to 
be a cut sccre. Table XVIII contains five pairs of actual 
and predicted scores which will be used to illustrate tne 
method. 



TABIE XVIII 

Illustrative Actual and Predicted Scores 



Actual Criterion 
Score 



Predicted Criterion 
Score 



44 

45 

46 
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47 

49 
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procedtre. 

The columns are the 
■J50 cases in this hypothe 
predicts that 300 of the 
criterion, and it predicts 
a ”one" cn the criterion, 
the cases. 250 people actu 



ual ICS of 48 months or mere 
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TAEIE XIX 

111 ustr atire 0 iscriminant Example 
Predicted 

0 1 Tot al 

0 100 150 250 

Actual 

1 200 300 500 



Total 300 450 



750 



500 people actually scored ’‘one" (successes) , Because, iu 
effect, the di scrininaut prccedure chooses its ewe cut 
score, the four cell proba tilities can be derived directly 
from the output. The predicted "ones" are people that the 
model classifies as accept. Of these 450, 150 actually 

failed sc they are false positives, and the remaining 300 
were successful, so they are valid positives. Of the 300 
cases that the mcdel would have rejected (predicted 
"zercs") , 100 were failures (valid negatives) and 200 were 

successes (false negatives) , Again the cell prohahilities 
are feund hy dividirc each ccunt by the number of cases. 
Therefore, PVP = 300/750, PIP = 150/750, PFN = 200/75C and 
PVN = ICC/750. For a discriminant model, there is cnly cne 
set of cell probablities to he calculated. 



E. ESllhATION OF CEII OTILIIIES 



In erder to calculate 
utilities associated with e 
estimated, "Although the 
cutcemes may very well be 
theory, it is not a prob 
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Ideally each selection 
estimated utility. Eecaus 
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utilities fcr each outcome (p articularlj for the false and 
valid necatives), relative utilities are estimated. It is 
apparent that a person who is correctly selected (valid 
positive) has a positive worth to the organization. A 
reascnaile estimate of this worth is the marginal product of 
the etplcyee. In this study it is assumed that the navy 
compensates sailors at the full value of their marginal 
product, and the Billet Cost Model provides an estimate of 
the cost to the Kavy of staffing a billet £Bef. It]. 
Because relative utilities are the issue at this time, the 
utility of a valid positive (01) is assigned the value of 
+ 1 . 

It is a reasonable assumption that the utility of a 
false positive is a negative number. As the employee was 
not judged to be successful, his marginal product was prob- 
ably less than the marginal cost to keep him in the job. In 
addition a poor performer may adversely affect the perform- 
ance and productivity of his fellow employees, and when he 
leaves, additional expense is necessary to find a replace- 
ment. Cn the other hand, it is unlikely that a poor 
performer does net contribute anything to the organization, 
and thus it is obviously difficult to estimate the magnitude 
of the disutility of a false positive. In this study a 
minor form of sensitivity analysis is undertaken to circum- 
vent this estimation difficulty, and expected overall utili- 
ties are calculated for three different relative values of 
false positive utility (02) . These values are -.5, -1, and 
a relatively extreme assumption, -2. 

The disutility of a false negative is also difficult to 
estimate, partly because it is not known what happens tc the 
applicant after he is rejected. If the Navy rejects an 
applicant tc the AD rating but accepts him in another rating 
where le is subseguertly successful, then his disutility 
could be reasonably argued to be zero. If, however, the 
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Kavy rejects him altogether when he would have been 
successful if selected, then the costs of attracting and 
testing him are wasted and additional costs are reguired to 
attract and test another applicant. These costs will depend 
cn the state of the recruiting marJcet at the time. If there 
are aany gcod Quality applicants then the disutility of 
rejecting a potentially successful applicant may be snail. 
Again, as a type of sensitivity analysis, three relative 
values fcr the utility of a false negative (U3) are consid- 
ered: 0, -.25 and -.5. 

It is net obvious that any utility should be assigned to 
D4, the utility cf a valid negative. Ihe person would have 
failed anyway, so nothing was gained by rejecting him. 
however, when viewed from an economist's viewpoint in rela- 
tion tc opportunity costs, the fact that the person was 

correctly rejected means that the organization did net have 
to bear the cost of incorrectly accepting someone who turns 
cut tc be unsuccessf ul. Thus, correctly rejecting an appli- 
cant is cf equal and opposite utility to incorrectly 
accepting him. Therefore, D4 = -U2. 

Ihe use of relative utilities is a convention to 

simplify the estimation of cell utilities. In the abeve 
discussion relative utilities are estimated on the basis 
that the utility of a valid positive is +1. However, the 
values of t1 through D4 that are used in the formula for 
overall expected utility, (Sguation 5.1), need tc be 

expressed in actual dcJlars. As mentioned above, the Eillet 
Cost eodel is used tc estimate the utility of a valid posi- 
tive. Ihe standard manyear cost of an E-4 Aviation 

tachirist's Mate is 324,163. This cost comes from financial 
year 1983 data and represents the total cost to the Navy of 
creating and filling a jot slot over one full year. 

[Bef. 16] A utility of +1 is therefore equivalent to 

♦$24,163, a utility cf -.5 will be -$12,082, and so cn. 
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SBCGIAHS USEE TO CAICULAIE UTILITIES 



As icsitioned in Ssctio 
cell f xctaiilities for a 
tedicus and repetitive pro 
three saajle programs used 
cf a model. Explaratory 
each set cf SAS statements 
computes tie predicted cri 
writes tie results cut to 
XXI shews part of tie outp 
second program's main purpo 
the cell pr chahilities tha 
predicted score were used 
abilities are written out t 
program also calculates th 
cf cell utilities and outpu 
result (Table XXIII) . The 
lates tie utilities for 
utilities. 



As explained before, 
results from a discriminan 
gained from the discriminan 
to calculate the expected u 
and these calculations were 



n A. above, the calculation of 
regression model is a fairly 
cedure. This section certains 
to calculate the expected utility 
comments are provided fcllcwing 
. The first program (Table XX) 
terion score for each case and 
a file called "RTYHATA”. Table 
ut from the first program. The 
se (Table XXII) is to calculate 
t wculd result if each different 
as a cut score. The cell preb- 
0 a file called "ETUTIIA”. The 
e expected utilities for one set 
ts the 30 largest utilities that 
third program (Table XXIV) calcu- 
six different sets cf cell 

only one set of proiaiilities 
t mcdel and these can be readily 
t output. No programs were used 
tilities of a discriminant model 
dene by hand. 



E. CAICCIATION CF BARE LINE DIIIITIES 

As described in Chapter V, the utility of the Navy's 
original selection strategy (the base line utility) needs to 
he calculated in erder for comparisons to be made. 
Observation 4 in Table XXIII demonstrates that when all the 
cases are accepted (41.0774 is the lowest predicted sccre) , 
the selection ratio is obviously 1 and PVP = .860636 (which 
is the base rate) and PFP =1 - PVP = . 139362. No one is 
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rejected/ therefore PfN acd EVN are zero. I he expected 
utility under these circumstances is: 

EO = .860638($24/ 16i) + , 1 59362 (-$ 12 ,082) ♦ C + 0 = 315,112 

As Table XXIII shows, the maximum utility occurs when 
the cut score is slightly higher than the lowest predicted 
score (there are five cases with a predicted score cf less 
than 43.26S2 in Table XXI). This maximum utility (319,135) 
is -12 percent greater than the base line utility of 
3 19, 1 12. 
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TABU XX 

First Ctilitj Aralysis Program 



//SEItllll JOB (2840,C104) , 'SEl CLARK, SaC 1 560' , C1A£S = E 
//♦MAIN CRG=NPGVM1 . 2840P 
// EXEC ‘^A*^ 

//FIIEIN EE BISP=SHE,DSN=MSS.S2807.AEA1L4 

y/FIlECDI EE UNI'I=33 20 V, MSVGP=P0E4A,DISP= (NEW , CAT LG , EEIEI E) , 
// DSE=MSS.S2840.RIYHAIA, 

// ECB= (ELKSIZE = 640 0) 

//SYEIN EE ♦ . 

CPTICKE 1E=£0 NOCENIEE; 

* IBE EOEPCSE OF THIS EEOGEAM IS TO CALCULATE THE PEEEICTED 
SCCRE FOR EACH CASE (USING IHE MODEL EEVELOPEE PRE- 
ICESll), AND 10 WRITE OUT TEE ACTUAL ANE PREEICTEE 
SCCRE5 TC A FILE IF MASS SICRAGE. ; 

EATA EATA1; 

SIT EILEIN. AEALL4 ; 

RENAME TAFMS1=Y; 

♦ REFAEING THE CRITERION VARIABLE; 

PROC REG EATA=DATA1 CCTEST= BETAS ; 

YHATlMCEEl Y = 

DEPENETS ESDG ELACK OTHER NUSEX TERMENLT AEMINS CR / STB; 
TITLE BLCCK REGEESSICN TO OUTPUT BETAS.; 

PROC SCCRE C0T=PREEY TYPE=OLS SCORE=BETAS DAIA=DATA1 PREDICT 
VAR EEPENETS HSDG HACK OTHER NUSEX TERMENLT AEMINSCR; 

♦ CALCULATES THE PREEICTEE SCORES, ANE WRITES THEM TC 
EATASET 'PREDY*. 

NCTE: THE SCORE PROCEDURE TAKES THE MODEL LABEL (YEAT) 

AND USES TEAT LABEL AS THE VARIABLE NAME FCR TEE 
PREEICTEE SCORE. ; 

EATA PEEEY2: 

SIT PREEY; 

KEEP YHAT Y SOCCTAE; 

PROC SORT EAIA=PREDY2 CUT=FILECUT. HTYHATA; 

BY YHAT; 

* 'CRTS TEE OUTPUT FILE IKTC ASCENDING YHAT ORDER, 

AFiE WRITES OCT TEE SORTED DATA TO MASS STCRAGE, ; 

EATA TE ^ T • 

SET EIllcCI. RT5HATA; 

IE N LE 10 OR ( N GT 1270 AND N LE 1280) 

~ “ OR _U_~GT 2790; 

PROC PRINT EATA = TEST SPLIT=*; 

LABEL Y=ACTCAL*CFIIERICN*£CCRE 

YHAT=PREEICTII*CRITERLCN*SCORE 
SUCCTAE=SUCCESS C N*CRITERIC N ; 

TITLE TEE FIRST 10, MIDDLE 10 AND LAST 30 OBS OF RTYHATA; 
TITLE2; 

TITLES NCTE: SORTED IN ASCENDING ORDER OF YHAT-; 

PROC UNIVARIATE EATA=IILEOUT. RT IHATA PLOT; 

VAR YHAT Y SUCCTAE; 

TITLE STATS OF THE ACTUAL AND PREDICTED CRITERION SCCEES; 
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TABXE XXI 

fartial Output from the First Utility Program 



FIBST 


10, MIIDIE 


10 AND LAST 30 


OBS OF RIYKATA 


NOTE: 


SOEIIi: IN 


ASCENDING ORDER OF YHAT. 


CEE 


aCTUAI 


SDCCESS ON 


PREDICTED 




CRITEBION 


CEITEEION 


CRITERION 




8COEE 




SCORE 


1 


45 


0 


41.0774 


2 


23 


0 


41.0774 


" 


66 


1 


42.0297 


4 


12 


0 


42. 0297 


c 


16 


0 


42.8960 


6 


2 1 


0 


43. 2692 


7 


1 1 


0 


43.2692 


£ 


54 


1 


43.2692 
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48 
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43.2692 


10 
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1 


43.2692 


1 1 
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5 1 


1 


48.7601 


13 


47 


1 


48.7601 


1 4 


48 


1 


48.7601 


15 


65 


1 


48.7601 


16 


7 1 


1 


48.7601 


17 


48 


1 


48.7601 


18 


33 


0 


48.7601 


IS 


48 


1 


48.7601 


2C 


4S 


1 


48.7601 


ii 


48 


1 


56.8469 


2 2 


51 


1 


56.8469 


2 3 


58 


1 


56.8469 


24 


59 


1 


56.8469 


25 


52 


1 


56.8469 


26 


7C 


1 


56.8469 


27 


5 1 


1 


56.8469 


28 


60 


1 


56.8469 


2 S 


4S 


1 


56.8469 


30 


5C 


1 


56.8469 


3 1 


56 


1 


57.7992 


32 


6C 


1 


57.7992 


' “ 


54 


1 


57.7992 


34 


27 


0 


57.7992 


:: c 


36 


0 


57.7992 


36 


54 


1 


57.7992 


37 


52 


1 


57.7992 


38 


63 


1 


57.7992 


^ c 


6 £ 


1 


57.7992 


40 


54 


1 


57.7992 


4 1 


51 


1 


57.7992 


42 


66 


1 


57.7992 


43 


64 


1 


57.7992 


44 


64 


1 


57.7992 


45 


72 


1 


57.7992 


46 


6 3 


1 


58.0774 


47 


64 


1 


59. 1512 


48 


52 


1 


59.3928 


4S 


62 


1 


6 1.21 15 


50 


46 


1 


61.21 15 
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TAEIE XXII 

Second Dtilitj Analysis Program 



y/SEIClII2 JOB (28« 0 . C 104) , • SEI CLARK, SMC 1 560' , Cl ASS= E 
//♦MAIli CEG = NPGVfl1.2t^J0P 
// EXEC SAS 

y/SA£.fiCEK ED SP ACE= KH, ( 1 2,4) ) 

//FIIEIN EE DISP=SHE,DSN=MS£. S2840.RIYHATA 

y/FIIECUI EE U NII=3330V, MSV GP=PU B4A, DISP= ( NEW ,CATIG , EEL El £) , 
// D£N=MSS.S284C-fiIUTILA, 

y/ ECB= (ELKSI2E = 640 0) 

y/SYSIN EE * 

CPTICES 1£=80 NOCENIIf; 

* THE PUEPCSE OF THIS PROGRAM IS TO WRITE OUT A FILE TO MASS 
STCEAC-E WEIGH CONTAINS THE VALUES OF PVP, PfP, PEN ANE PVN 
THAT RESOll WHEN EACH PREEICTED SCORE IS USED TO SEPARATE 
THE CASES INTO ACCEPT AND REJECT GROUPS (IE- OUTPUT THE 
CEII PfCEABILITIES THAI RESUIT WHEN EACH PREDICTEE SCCRE 
IS tSEI AS A CDTTINC- SCORE). 



THE 



INPUT FILE 
CASES) ARE 



CONTAINS 3 VARIABLES, AND THE OBSERVATIONS 
SORTIE IN ASCENDING ORDER OF 'YHAT'. YHAT 

DEVELOPED EARIIER) OF 

EACH CASE, 'Y* IS TEE 'ACTUAL ICS IN MONTHS AND 'SUCCTAE* 

IS A DUMMY VAEIAEII WHERE EACH CASE IS CATEGORIZED AS A 
SUCCESS (1) OR AS A FAILURE (0).; 



lOE 

IS TEE PREDICTED ICS (FROM TEE MODEL 



DATA IATA1: 

SIT EIIEIN. RTYHATA; 
DRCP Y; 

RENAME SUCCTAE = Y: 



THE DATA IS READ IN AND THE ACTUAL LOS IN MONTHS VARIABLE 
IS ERCPPEE AND THE LUMMY VARIABLE IS RENAMED 'Y'.; 



PROC SUMMARY 
VAR Y: 

CUTPCT CCT=DATA2 



DATA=DATA1; 

SCM=NSUCC N=NCASE, 



* HERE TEE NUMBER OF SUCCESSIUl CASES IN THE DATA (NSUCC) IS 
FOUND BY SUMMING THE 1'S AND O'S IN VARIABLE 'Y'. ANCTEER 
VARIAEIE 'NCASE' IS CREATED WHICH IS THE NUMBER Of CASES 
IN TEE DATA. THESE TWO VARIABLES (EACH A SINGLE NUMBER) 
ARE WRITTEN TO DATA SET WCRK.DATA2. ; 

LATA T AT * 

V iQ 1 then sit DATA2; 

NEAII~= NCASE-NSUCC; 

SET EATAl; 

* THE VARIABLES NCASE, NSUCC AND NFAIL (THE NUMBER OF UNSUC- 
CESSfUI CASES IN TEE DATA) ARE ADDED TO DATAl- NCASE, 
NSUCC AND NFAIL ARE EACH SINGLE NUMBERS THAI ARE REPEAIED 
FOR EACH CBSERVATICN. EG- NCASE IS A COLUMN OF SCO'S 
(SAY), NSUCC IS A CCIUMN OF 325'S (SAY) AND THEREFCEE 
NFAII IS A COLUMN CE 175'S.; 
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03= -6041; 04= 12032; 



lATA IAIA4; 

CJTA3* 

01= 2tl63^ 02= -12C82; 
fiEOAIi; K2ZE0 0; 

EEIAIK lASTYHAT 0; 

II YEAI NE lAITYHAl THEN LINK CAICS; ELSE IIHK ZEECS 
IE Y=C lEEN NZEEC = K2E£0+ 1 ; 

1A£IYEAI=YHAT; 

EEIOEN; 

CAIC5: VE = N£OCC-( N -1-NZESO); 

EE = NEAIL-E:ZE50; 

FN = N -1-KZEEO; 

VN = 'HZEEC 



Dill = (OliliP + 02*fE + 
£EATIO = (VE + FP) /NCASE; 
SDCCBAIE = ’BE/(VP+fE); 



03*fN + 04*VN) / NCA£E ; 



ZEECS; 



EEIOEN 
VE = 0 
OIIL ='0 
EEIOEN ; 



EE = 0 
£EAIIO 



= 0 ; 



FN = 0 i 
SOCCEATE 



= 0 ; 



VN = C; 



F 


CF 


E 


(OB 


G 


THE 


W 


ILL 


40 


1ST 



THIS IS TEE HEAET CE THE EEOGEAN WHEEE SOBTIE LOGIC IS 
EHEICYEL. »NZEBO» IS A COONIEB WHICH COONIS THE NOMEEF 
O'S IN THE »Y' VAEIABIE DCWN TO AND INCLOCING THE USE 
OBSEEVAIICN) CONTAINING THE ’COBEENT’ COITING SCOEE. 

FOE EYAHEIE, IF THEBE ABE 150 ZEEOS AND 250 ONES AMONG 
FIESI 400 OBS. OF 'Y', THEN IHE 400TH OBS. CF 'NZEEC* 

BE 150. IF THE 401ST OBS. CF *Y' IS A ZEEO THEN TEE 
OBS CF 'NZEBO' Will BE 15 1. TO CONTINOE IHE EXAMPLE. 
BECADSE TEE INPOT £AIA IS SCETED IN ASCENDING OEDEB OF 
•YEAI* TEE 400 CASES PEECEDING THE 401ST CASE (WHICH IS 
THE COBEENT COITING SCOEE), fiCOLD ALL BE CLASSIFIED AS 
EEJECI BECAUSE THEIB PBEDICTED SCOEE IS LESS THAN IHE 
CDIIING SCOEE. THE 400TH OBS. OF 'NZEEO' TELLS US HOW 
MANY CF lEESE EEJECIED CASES WEEE FAILOEES AND IHEEEFCBE 
VN = NZEEC. 'NFAIl* IS THE TOTAL NOMEEE OF CASES THAI 
FAIIED, lEEEEFOEE NiAIL-VN (SAUE AS NEAIL-NZEEO) = FF. 

THE KOMBEE OF CNES IN THE EEJECIED 400 CASES (FN) IS IHE 
COEBENI CBS. (401), MINOS 1, MINOS THE NOMBEE OF ZEECS, OB 
FN = 401-1-150 = 25C. FINAllY, ‘NSOCC* IS IHE ICIAL 
NOMEEE CF SOCCESSES, THEEEFOEE NSOCC-FN IS IHE VAIOE CF VP, 



•LASIYEAI* IS OSEB TO PEECLUEE ANY EEEOES THAT WOOIL EE 
GENEEAIEI WHEN TWO CE MOBE VAIOES OF »YHAT' AEE IDENIICflI. 
IF lEE NEXT POIENTIJL COITING SCOEE IS IHE SAME AS IHE 
PEEVICDS CNE, THEN NC CELI EBOEABILITIES, ETC AEE CALCOI- 
AIED, AND ZEEOS ABE ASSIGNED. 

NOIE; DUE TO IHE CSE OF IHE KEYWOED ’EEIAIN* IHE VAIOES 
CF NZEEO AND LASTYHAT USED IN THE CAICUIAIICNS AND 
IN THE FIfiSI 'IF' STATEMENT AEE THE VALUES FBCfi IHE 
PEEVICUS OBSEEVAIICN. 

THE DATA STEP ALSO INITIAIIZES A SET CF INDIVIDUAL CEIL 
UIIIIIIES (01 - 04) AND CAICDIAIES THE OVEEALL UIIIIIY 
ASSCCIAIED WITH EACH CUTTING SCOEE. ALSO TEE SELECIICN 
EAIIC AND THE SUCCESS BATE EESULTING FEOM EACH CUTTING 
SCCBE AEE CALCULATII.; 

DATA IATA5; 

^ E I E ill 4 • 

PVF = VF/SCASE; FFP = FP/KCASE; 

PEN = FN/NCASE; FVN = VN/NCASE: 

KEEP YHAI UTIL PVF FFF PEN FVN SEATIO 
EEKAME YEAT = CSCCBE; 
lABEI 

CSCCBE=CDT SCOEE ON FBEDICTCE; 



SOCCEATE; 



* CCNVEEIING THE CEIl COUNTS TO FEOBABIIITIES. 
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lEOC 'CEl I£IA=EATA5 CUI-FIIEC DI . £T UT IL A ; 

£Y li'CEKDING UTIl; 

* SOEIIHG £Y UTIl BEICfE WEITIfiG CUT THE PEEV10U51Y KEPI 
VAEIiEIES TO A FILE IN MAES STOEAGE.; 

lATA EAIA6: 

SEI FllECUT.EIOTIIi ; 

IE _N_ IE 30; 

EEOC IE3KT IATA=EATA€: 

3ITLE TEE 30 LAEGEST CTILITIES IN THE FILEC0T-; 

IIT1E2; 

TITLE3 TEE EASE UTILITY IS 19112, AND THE; 

TITLE^l; 

TITLES EASE LINE SUCCESS BATE IS 0- 8606; 

iEOC IICT LATA=EATA6; 

PICT DHL * CSCOEE = ' + • / VEEF =19112; 

TITLE TEE TCP 30 UTILITIES ILCTTED AGAINST CUillNG SCOSE.; 
TITLE2; 

TITLES NCTE: THE HCEIZ. LINE IS THE BASE LINE UTILITY,; 

TITLEt); 

TITLES IE. THE UTILITY EESUIIING FEOM THE NAVY"S; 

TITLE6; 

TITLE/ CEIGINAL SELECTION STEATEGY- (19112); 

IEOC ELCT IATA=EATA6: 

PICT DHL * SEATIC = • + • / YEEf =19112; 

TITLE TEE TCP 30 UTIIITIES PLCTTED AGAINST SELECTION EATIC. 
TITLES; 

TITLES NCTE; THE HOEIZ. LINE IS THE BASE LINE UTILITY,; 
TITLEiJ; 

TITLES IE- THE UTILITY EESUIHNG FROM THE NAVY"S; 

TITLES; 

IITIEI CEIGINAL SELECTION STEATEGY. (19112); 

EEOC ELCT LATA=FILECCT.ETUTILA: 

PICT CHI ♦ SEATIC = • + • / VEEF =19112; 

TITLE ELCTTING ALL UTILITIES AGAINST SELECTION EAHO.; 
TITLES; 

TITLES NCTE: THE HOEIZ- LINE IS THE BASE LINE UTILITY,; 

TITIEU; 

TITLES IE. THE UTILITY RESULTING FEOM THE NAVY"S; 

TITLES; 

TITLE7 CEIGINAL SELECTION STRATEGY. (19112); 

/* 

// 
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1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 

30 



lABlE XXIII 

fartlal Output from the Second Otility Pra>grai 



lEE 30 LABGEST UTIIIIIES IN THE fILEOUI. 

THE BASI OTILITY IS 19112, ANE THE 
EASE LIM SUCCESS BATE IS 0. 8606. 

CSCCEE DTIl SBiSTIO SCCCBATE PVP PE? PEN PVN 



43.2692 


19 135 


0 .998 


0.862 


42.C297 


19129 


0 .999 


0.861 


42.8960 


19127 


0 .998 


0.862 


4 1.0774 


191 12 


1-CCO 


0.861 


43.8483 


18998 


0 .988 


0.864 


44.2215 


18988 


0 .988 


C.863 


44.6648 


18865 


0 .982 


C. 863 


43.C879 


18855 


0.982 


0.863 


45.6261 


1879 1 


0.980 


0.863 


46-C401 


18780 


0.979 


0.863 


46.5683 


18748 


0.978 


0.863 


46.6815 


166 14 


0.856 


0.877 


47.279C 


1659 1 


0.855 


C.877 


47-5206 


16514 


0.851 


0.877 


48.2312 


15725 


0.807 


0.883 


48.387C 


157 19 


0 .805 


C.884 


48.7601 


15535 


0.799 


0.883 


4S-33S2 


6 169 


0.395 


C.896 


4S.47C8 


5961 


0.383 


0.899 


49.5373 


53 00 


0.359 


C.896 


49-7124 


5289 


0.358 


0.896 


4S.98C6 


1834 


0.217 


0.898 


5C.4231 


1 756 


0.213 


C.899 


5C.5788 


1298 


0.193 


0.904 


5C- 9329 


195 


0. 146 


0-910 


5 1.5310 


182 


0.145 


C.S12 


4 1.C774 


0 


O.COO 


0.000 


42. 0297 


0 


O.COO 


0. 000 


43.2692 


0 


O.COO 


0.000 


43.2692 


c 


O.COO 


0.000 



0.860 


0.138 


0 . 001 


C.COl 


0.861 


0.139 


0.001 


0. 001 


0.860 


0.138 


0. CC1 


C. C01 


0-861 


0-139 


0 . occ 


c.ooo 


0.8 53 


0.135 


0. 0C7 


C- C05 


0.853 


0.135 


0.008 


0. 005 


0.848 


0- 134 


0.012 


C.C05 


0.848 


0. 134 


0.013 


0. COS 


0-846 


0. 134 


0.015 


C.C05 


0.845 


0. 134 


0. 015 


0. 005 


0.844 


0. 134 


0.016 


C.005 


0.751 


0. 105 


0. 1 10 


C. 034 


0.750 


0.105 


0.111 


C. 034 


0.747 


0.105 


0. 114 


0. 035 


0.713 


0.095 


0. 148 


C. C45 


0.712 


0.094 


0. 149 


0. 046 


0.705 


0.093 


0. 155 


C. C46 


0.353 


0.04 1 


0.507 


C. 098 


0.345 


0.039 


0.516 


C. 10 1 


0.322 


0.037 


0. 539 


C. 102 


0.321 


0.037 


0.539 


C. 102 


0.195 


0.022 


0.666 


0. 117 


0.192 


0.022 


0.669 


C. 118 


0.1 74 


0.018 


0.686 


C- 121 


0. 133 


0.013 


0. 727 


C- 126 


0.133 


0.013 


0.728 


C. 127 


0.000 


0.000 


o.occ 


C. COO 


0.000 


0.000 


0.000 


C. 000 


0.000 


0.000 


0 . 0 0 C 


c. coo 


0.000 


0.000 


O.OCC 


C. 000 
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TABLE XXIV 

Third Otilitj Analysis Program 



y/SEICIII3 JOB / 28h 0 - C 10 4) , » S£1 CLAfiK, SMC 1560', C1A££ = £ 
//♦MAIN CEG = NPGVM1 .2eilOP 
// EXEC SAS 

//FIIEIN El DISP=SHE,DSN=M£S. S2840.RTUIILA 
//SYS3N ID ♦ 

CPTICNS I£=£0 NOCENTEE; 

♦ TBIS EECGEAM EXPICEES THE EFFECTS OF USING DIFFEEENI CELL 
UIIIIIIES FOR THE CAICUIATICN OF OVERALL UTILITY.; 

TATA TAT^I* 

SET FIlilN.ETUTILA; 



0 1A= 1 


; 02A= 


-.5 ; U3A= 


0 


U4A= 


•5 ; 






Ul£= 1 


; U2E= 


-1 ; U3E= 


0 


U4B = 


1 ; 






010= 1 


; 02C= 


-! ; U3C= 


0 


U4C = 


2 ; 






Ult= 1 


; 021= 


-.5 ; 03 E = 


-.5 


U4D= 


.5 ; 






U1E= 1 


; U2E= 


-1 ; U3E= 


-.5 


U4E= 


1 ; 






0 1F= 1 


; U2F= 


-! ; U3F= 


-.5 


U4F= 


2 ; 






UTILA= / 


;evp*oia 


+ EFP*02A + 


PFN*U3A + 


PVN*U4A| 




24163; 


UTILE= 


PVP+UIB 


+ EFP+U2E + 


PFN*U3B + 


PVN*U4B) 


* 


24163 ; 


0TI1C= i 


EVP *0 1C 


+ PFP+U2C + 


PFN*U3C + 


PVN+U4C) 


♦ 


24163 ; 


UIILD= ( 


[EVP*U1D 


+ EFP*U2I + 


PFN*U3D ♦ 


PVN*U4D) 




24163 ; 


UT3LE= j 


PVP*U1E 


+ PFP*U2E + 


PFN*U3E + 


PVN*U4E) 


* 


24163 ; 


UTILF= 


EVP + UIF 


+ EFP*U2F + 


PFN*U3F ♦ 


PVN*U4F) 




24163 ; 



LROC SCET IATA=BATA1 CUI=FI£ST; 

EY EESCENDING UTIIA; 

LATA FIEST: 

SET FIRST; 

KEEi CSCCRE PVP PFF PFN fVN SRATIO SUCCEAIE UTILA; 

IE N IE 30; 

EROC I^ITT; 

TITLE EASE UTILITY IS 19112 AND BASE SUCCESS RATE IS .8606; 
TITLE!; 

TITLE3 01= 1 , U2= -.5 , U3= 0 , 04= .5 .; 

EEOC ELCT IATA = FIRSI: 

PICT CIIIA * SEATIC = *+• / VEEF =19112; 

EEOC SCET £ATA=DATA1 COT=SECCND; 

EY lESCENDING UTIIE; 
lATA SECCNE; 

SET SECOND; 

KEEP CSCCRE PVP PIE PFN PVN SRATIO SUCCRATE UTILE; 

IF N IE 30; 

EROC E'EI'KT ; 

TITLE EASE UTILITY IS 17428 AND BASE SUCCESS HATE IS .8606; 
TITLE!; 

TITLE! 01= 1 , U2= -1 , 03= 0 , U4= 1 .; 

EROC EICT IATA=S£C0NI; 

PICT CTILB * SRATIC = •+• / VEEF =17428; 
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EEOC SCEl IEIA=DATA1 CUI=THIED; 

EY EESCEKDING OTIIC; 
lATA IHIED; 

£E1 IEIE£; 

KEEP CSCCRE PVP PEE PEN EVN SRATIO SUCCEAIE UrilC; 
IE N IE 30 ; 

EEOC ETI'K'I; 

IITLE EASE OIILIIY IE 14061 AN! EASE 
1IT1E2; 

IITIE* 0 1= 1 , 02= -2 , 03= 0 

EEOC EICI EAIA = THIEE: 

PICI CIIIC * SBATIC = •+• / VREF = 



SOCCESS RATE IS 
/ 04= 2 
14061 ; 



EROC EOEI IATA=DATA1 COT=FOOEIH; 

EY rEECEKDING UTIIE; 

EATA ECDEIH: 

SEI ECOEIH: 

KEEP CSCCRE PVP PEE PEN EVN SRATIO 
IE N IE 30; 

EROC ETITI; 

TITLE EASE OTILIIY IE 19112 ANE EASE 
TITLE2; 

IITLE3 01= 1 , 02= -.5 , 03= -.5 

EEOC EICI IATA=FCURTE: 

PICT CTIIE * SEATIC = •+• / VREF = 

EROC SORT £ATA=DATA1 COT=FIFTH; 

EY EEECEKDING OTIIE; 
lATA EJETE: 

SET EIEIE: 

KEEP CSCCRE PVP PEE PEN PVN SRATIO 
IF N IE 30; 

EEOC EIITT; 

TITLE EASE UTILITY IE 17428 AN! EASE 
TITLE2; 

TITLES 01= 1 , 02= -1 , 03= -.5 

EROC EICT £AIA=FIFTB; 

PICT CTIIE * SEATIC = •+• / VREF = 

EEOC ECRT EATA=DATA1 COT=SIXTH; 

EY EEECEKDING OTIII; 

EATA EIYTE; 

SET SIXT H ' 

KEEP CECcJe PVP PEE PEN EVN SRATIO 
IF N IE 30; 

EROC mil; 

TITLE EASE UTILITY IE 14061 ANE EASE 
TITLE2 ; 

TITLES 01= 1 , 02= -2 , 03= -.5 

EROC EICT £ATA=SIXTH; 

PICT OTIIF ♦ SBATIC = •+• / VREF = 



SOCCRATE OriLD; 

SUCCESS RATE IS 
, 04= .5 .; 

19112; 



SOCCRATE UTILE; 

SUCCESS RATE IS 
/ 04= 1 
17428; 



SUCCEAIE UTILE; 

SUCCESS RATE IS 
, 04= 2 . ; 

14061 ; 



/* 

// 



. 8606 



.8606 



.8606 



. 8606 
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